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Abstract 

Immunization protocols based on priming with plasmid DNA and boosting with recombinants of vaccinia virus (rVV) encoding the 
same antigen offer great promise forthe prevention and treatment of many parasitic and viral infections for which conventional vaccination 
has little or no effect. To overcome some of the potential problems associated to the use of piasmids, we have developed minimalistic, 
immunogenically defined, gene expression (MIDGE®) vectors. These linear vectors contain only the minimum sequence required for gene 
expression and can be chemically modified to increase the immune response. Here, we demonstrate that MIDGE vectors coding for the 
LACK antigen confer a highly effective protection against Leishmania infection in susceptible Balb/c mice. Protection is achieved at lower 
doses of vector compared to conventional piasmids. This efficacy could be greatly improved by the addition of a nuclear localization signal 
(NLS) peptide to the end of the MIDGE vector. In fact, immunization with two doses of NLS-modified MIDGE conferred similar or even 
better protection than that achieved by priming with plasmid DNA followed by boosting with rVV. These results demonstrate that MIDGE 
vectors are a good alternative to plasmid and rV V for immunization. 
© 2002 Elsevier Science Ltd. All rights reserved. 
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1. Introduction 

Leishmaniasis is a disease with different clinical manifes- 
tations produced by Trypanosomatiaae of the genus Leish- 
mania. Depending on the host immune response, the strain 
and the virulence of the parasite, the clinical manifestations 
extend from self-healing cutaneous lesions to the visceral 
form of the disease, the latter being fatal if untreated. World- 
wide there are 2 million new Leishmania cases each year 
and the disease is endemic through parts of Africa, southern 
Europe and Central and South America [1]. With the advent 
of the HTV* epidemic, leishmaniasis has surged as a reacti- 
vated infection in AIDS patients in many countries. More- 
over, in Spain and the southwest of Europe leishmaniasis 
is zoonotic and dogs are the main reservoir host Epidemi- 

Abbrevlations; MIDGE, minimalistic, linear expression vectors; NLS, 
nuclear localization signal 
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ology of the disease revealed that between 10 and 37% of 
dogs in the Mediterranean area are infected and develop the 
visceral form of the disease [2]. 

Eradication of the disease has proven difficult. Chemother- 
apy has only a modest effect and there is no effective and 
safe vaccine against any form of clinical leishmaniasis. 
However, individuals who recovered naturally from infec- 
tion develop strong immunity against re-infection suggesting 
that vaccination against leishmaniasis is feasible. Infection 
of inbred strains of mice with Leishmania major provides 
the best model for the irnmunoregulation that occurs during 
a cell mediated response to this intracellular pathogen [3]. 
Using the Leishmania major model, studies demonstrated 
that the generation of protective immunity against L. major 
is T cell and cytokine mediated [4]. Expansion of a Thl 
subset of CD4 + T lymphocytes secreting IFN-v and IL-12 
is associated with resistance to infection [4]. By contrast, 
susceptible mice expand CD4 + T lymphocytes belonging 
to the Tb.2 subset, which secretes IL-4, IL-10 and 1L-13 
[5]. In human and dog, resistance to visceral leishmaniasis 
is also associated with the generation of a Thl response 
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or a mixture of ThI and Th2 cytokine patterns. Here, the 
absence of a nil response is associated with chronicity of 
the disease and high mortality in untreated cases [4]. Taken 
together these findings show that, as in the mouse model, 
the outcome of the disease appears to be determined and 
regulated by the balance between the two T cell populations 
E6,7]. 

Several antigens have been used in experimental vaccina- 
tion trials in murine leishmaniasis, achieving various levels 
of protection. Among these are L major gp63 [8], gp46 [9], 
p4, p8 [10] and LACK [1 1 .]. The LACK antigen is a 36 KD 
protein highly conserved among related Leishmania species 
and is expressed in both the promastigote and the amastig- 
ote forms of the parasite. LACK is a preferential target for 
the early anti-parasite immune response [12]. The infection 
induces a strong anti-LACK response and the early activated 
LACK reactive cells exhibit a Th2 phenotype [13], Immune 
interventions aimed at modifying the T cell repertoire may 
be used to alter the course of an infection. The option to 
modulate this rapid anti-LACK immune response by exoge- 
nous IL-12 or IFN--y suggests that these cells are not yet 
fully mature Th2 cells and so can be redirected [13]. Fur- 
thermore, vaccination trials with soluble ieishmania antigens 
or a single LACK protein in the presence of recombinant 
IL-12 have been shown to induce a protective ThJ response 
[13, 14]. Thus, it seems that deviation to Thl of the strong 
and specific Th2 immune response against LACK in Balb/c 
mice induces resistance to infection [15]. 

During the last decade, DNA immunization has been pro- 
moted as a new alternative for achieving specific immune 
responses. Protective responses with DNA vaccines against 
several pathogens have been demonstrated [16,1 7], The abil- 
ity of plasmid DNA encoding specific antigens to induce 
both CD4+ and CD8+ T cell responses could be of particu- 
lar use for protection against diseases that require cell medi- 
ated immunity, including leishmaniasis. The low production 
costs and the high structural stability of DNA make DNA 
vaccination a very attractive tool for immunization. On the 
other hand, recombinants of vaccinia virus (rW) express- 
ing different foreign antigens have been successfully used to 
elicit protective immunity to a variety of pathogens [18-21]. 
For instance we have shown that immunization with rW 
expressing the L amazonensis gp46 antigen elicits signif- 
icant protection and long-term immunological memory in 
BALB/c mice [22]. Experiments in mice and other mod- 
els have shown that a combination of these two approaches 
(priming with DNA and boosting with recombinant vaccinia 
expressing the same protein) is associated with the highest 
immunogenicity and protective efficacy against several in- 
fectious agents. Using this approach we have demonstrated 
a good protection against L major in mice with the LACK 
antigen [23], 

Expression plasmids typically used in DNA-based vacci- 
nation usually contain a transcription unit and bacterial se- 
quences necessary for plasmid amplification and selection. 
In order to avoid some of the potential and principal prob- 



lems associated with the use of these plasmids, we have de- 
signed mhnmalistic, unmunogenically defined gene expres- 
sion (MIDGE) vectors that contain only the eokaryotic gene 
expression cassette [24]. Thus, MIDGE constructs carry no 
sequence elements other than those needed for gene transfer. 
Their ends provide unique and selective target sites for the 
coupling of modifying molecules. To assess then utility for 
vaccination against leishmaniasis, MIDGE vectors express- 
ing the LACK antigen from L. infantum were constructed. 
A nuclear localization signal (NLS) peptide derived from 
the T antigen of the SV40 virus was covalently linked to 
modification target sites at the ends of the LACK expressing 
MIDGE vectors. NLS peptides conjugated to plasmids have 
been shown to increase transfection efficiency which could 
result in enhanced immunogenicity [25-27]. NLS-modified 
and non-modified MIDGE vectors were investigated in their 
ability to confer protection against challenge infections, and 
compare in this aspect to various schemes of plasmid-based 
and rW vaccinations. 

Here, we show that immunization of mice with MIDGE 
vectors coding for p367LACK, induced the same level of 
protection against challenge with L. major than equimolar 
amounts of plasmid coding for the same antigen. Most im- 
portantly, two doses of NLS-modified MIDGE conferred full 
protection against the same antigen, resembling the results 
obtained by priming wiui plasmid DNA and boosting with 
rW, both expressing LACK. 



2. Material and methods 

2.1. Construction of plasmids, MIDGEs and modified 
MIDGEs 

The cDNA encoding the LACK protein from L infan- 
tum was obtained as described previously [1 1] and inserted 
downstream of the CMV promoter in the EcoRV site of 
the pcDNA3.1 expression vector (Invitrogen, San Diego, 
CA). The coding sequence of LACK/p36 was subcloned 
into the Sstt. and Kpnl sites of the pMOK plasmid (Molo- 
gen, Berlin, Germany) generating the plasmid pMOK-p36. 
The MTDGE-p36 construct was derived from the pMOK-p36 
plasmid after complete digestion with Eco31l (MBI Fermen- 
tas, Vilnius, Lithuania), The ends were used to ligate hair- 
pin oligodeoxyribonucleotides (ODNs) with T4 DNA lig- 
ase. The mixture was concentrated and heated with Eco3R 
and T7 DNA polymerase in the absence of deoxyribonu- 
cleotides. The DNA was purified by anionic exchange col- 
umn chromatography (Merck EMD-DMAE, sodium phos- 
phate P H 7.0-0.1 M NaCl). 

The NLS peptide (PKKKRKVEDPYC, a generous gift 
from Dr. P. Henklein, Charite, Berlin, Germany) was cou- 
pled to one of the hairpin ODN in two steps as previously 
described [24]. The resulting NLS-coupled ODN was pu- 
rified. The plasmid pMOK encoding the surface antigen of 
the hepatitis B virus (HBsAg) was used as a control. 
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2.2. Construction of recombinant vaccinia 
viruses (rW) 

The cDNA encoding p36 from L infantum was cloned into 
the pSCll W insertion plasmid under control of the p7.5 
early/late viral promoter in the thymidine kinase (TK) locus. 
This plasmid contains the E.coli p-galactosidase gene under 
the control of the pi 1 viral late promoter (pSCp36) . Vaccinia 
virus (VV) recombinants were derived from the wild type 
Western Reserve (WR) strain, and rWp36 was prepared by 
transfecting with the insertion plasmid WR-infected BSC -40 
cells [20). The recombinant viruses were harvested 48-72 h 
post-infection and selected after plaque assay by addition 
of X-Gai to the agar. p-Galactosidase-producing plaques 
picked and re-plated three times and amplified following 
standard procedures [23,28]. 

2.3. Cells 

African green monkey cells (BSC -40) and HeLa cells 
were cultured in Dulbecco's modified Eagle medium (DM- 
EM) supplemented with 10% newborn calf serum (NCS, 
Gibco BRL, Paisley, UK). Viruses were grown in HeLa 
and titrated in BSC40 cells. COS-7 cells were cultured in 
Dulbecco's modified Eagle medium (DMEM) supplemented 
with 10% fetal calf serum (FCS, Gibco BRL, Paisley, UK). 

2.4. Mice 

Female 6-8-week-old BALB/c mice, maintained under 
pathogen-free conditions, were obtained from the facilities 
at the Centra Nacional de Biotecnologi'a (Madrid). 

2.5. Parasites and reagents 

L major (WHOM/TR/-173) was a kind gift from Dr. 
N. Glaichenhaus (CNRS, Valbonne, France). Promastig- 
otes were cultured at 27 a C in Schneiders medium (Gibco 
BRL, UK) supplemented with 20% fetal calf serum and 
antibiotics. Parasites were expanded in BALB/c mice and 
L major amastigotes were obtained from a hind footpad 
lesion. After transformation from amastigotes to promastig- 
otes, a synchronized culture was established for 9 days until 
parasites were in a late stationary phase. 

2.6. LACK/ p36 expression in transiently 
transfected cells 

3 x 10 5 COS-7 cells were transiently transfected 
with equimolar amounts of the different DNA vectors 
(pMOK-p36: 4u.g; MTDGE*p36 and MTDGE-NLS-p36: 
2.2 jig) using Lipofectamine transfection reagent (Gibco 
BRL, UK). One day later expression of LACK/p36 was 
detected by Western blot analysis of whole cell lysates. The 
Western blot was reacted with an anti-p36 rabbit polyclonal 
antibody as previously described (23). For band quantifica- 
tion, the Western blot was digitized and analyzed with the 



NIHimage program (National Institutes of Health, Bethesda 
Maryland). After equalization of the image, the mean den- 
sity was measured for each band using a selection of fixed 
area and shape. 

2.7. Immunization and infectious challenge 

Groups of BALB/c mice (n = 10-11) were primed in- 
tradermally (i.d.) in the back with 100 u.g per mouse of 
pMOK-p36 or pMOK-HBsAg in 100 p,l volume. The groups 
primed with MTDGE-p36 and MtDGE-p36-NLS received 
54.8 ug per mouse which is an equimolar concentration of 
the plasmids (Table 1). Two weeks later (14 d.p.i.) mice 
were boosted either i.d. in the back with the same amount 
of DNA or intraperitoneally (i.p.) with rWp3 6 (5 x 10 7 pru 
per mouse). The non-immunized group received the same 
volume of buffer (150 mM sodium phosphate). Three weeks 
after boosting (34 d.p.L), three mice per group were sacri- 
ficed and sera were obtained. The following day (35 d.p.i.), 
mice (n = 7-8) were challenged subcutaneously in the right 
hind footpad with 5 x 10 4 live stationary phase L. major 
promastigotes. Lesion development at the inoculation site 
was measured weekly with a digital caliper (Mauser Digi- 
tal, Switzerland) and expressed as the increase in thickness 
of infected versus uninfected hind foot Mice were sacri- 
ficed at week 8 post-challenge and serum, lymph nodes and 
spleen collected. 

2.8. Evaluation of cytokine production 

Cytokine levels in cell culture supematants were deter- 
mined by BUS A. Single cell preparations from spleens were 
plated in triplicate at4 x 10 6 cells/ml in 24 well plates (Nunc, 
Denmark). Soluble antigenic peptide (2y.g/ml) (a kind gift 
from Dr. N. Glaichenhaus [29]), p36 protein (2 jig/ml) pre- 
pared as previously described [11], soluble leishmanial anti- 
gen (4jxg/ml) and ConA (2p,g/ml) were added in a final 
volume of 2 ml/well, Supematants were harvested after 48 
and 72 h and stored at -80 °C until used. IFN-y and IL-4 
levels were assessed by specific ELISA using capture and 

Table 1 
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secondary antibodies from Pharmingen (Bee ton Dickinson, 
USA) following the manufacturer's instructions. 

2.9. Detection of total IgG antibodies against p36 

One day before challenge and 7 weeks post-challenge, 
serum was collected from each group of animals and spe- 
cific anti-p36 IgG antibodies and their isotypes {IgGl and 
IgG2a) were analyzed by ELISA. In brief, 96-well Max- 
isorp plates (Nunc, Denmark) were coaled overnight at4°C 
with recombinant p36 (5 y,g/rnl) prepared as previously de- 
scribed [1 1]. Serum samples were diluted 5- and 10-fbld in 
blocking buffer (1% BSA in PBS-T), added in 50uJ/well 
and incubated lh at 37 °C. Peroxidase-conjugated goat 
anti-mouse total IgG, IgGl or IgG2a (Southern Biotechnol- 
ogy Associated, Birmingham, AL) was added and incubated 
for 1 h at 37 °C. Plates were then reacted with peroxidase 
substrate OPD (Sigma, St Louis, MO) and absorbance was 



read at 492 am on a Labsystem Multiskan Plus plate reader 
(Tecan Maguellan, Sunrises). 

3. Results 

3.1. LACK/p36 encoding MIDGE vectors 

The p36 encoding sequence was inserted into pMOK gen- 
erating the plasmid pMOK-p36 (Fig- IA). In this vector the 
LACK/p36 protein is expressed under the control of pro- 
moter and enhancer sequences from the immediate early re- 
gion of HCMV. LACK/p36 expression was improved in the 
pMOK vector by introducing an intron sequence between 
the CMV promoter and the LACK/p36-encoding sequence. 
pMOK-p36 contains bacterial sequences required for plas- 
mid amplification and selection (Kan R ). 

The MIDGE-p36 construct was generated from the 
plasmid pMOK-p36 as described in Section 2 (Fig. I A). 




Fig. 1. Construction of MIDGE vectors and expression of LACK/p36 from different constructs. (A) The LACK-encoding pMOK plasmid was used to 
generate the LACK-encoding MIDGE constructs. By using NLS modified hairpin ODN we generated the MUX3E-p36-NLS construct; (B) Expression 
of LACK/p36 protein in transiently transacted COS-7 cells. Cells were transiently teansfeeted with equimolar amounts of DNA (pMOK-p36 and 
pMOK-HbsAg: 4|xg; MIDGE-p36 and MIDGE-p36-NLS: 2.2 |ig) using lipofcttauime tainsfecticn reagent, The expression of the protein was delected 
by Western blot analysis of lysate cells. One representative experiment is shown. Lane I: pMOK-HBsAg; lane 2; MTDGE-p36-NLS; lane 3: MIDGE-p36; 
lane 4; pMOK-p36; (C) The figure shows me result of band densitometry expressed in arbitrary units (see Section 2). 
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MIDGE-p36 is a linear construct with covalently-closed 
ends. This expression unit contains only the promoter, in- 
tron, LACK/p36 encoding, and polyadenylation sequences 
(Fig. I A). The linearity of the construct was confirmed by 
digestion with an appropriate restriction enzyme that has a 
single recognition site in the construct and generated two 
DNA fragments of the expected size (data not shown). In 
order to increase transfection efficiency and the expression 
of the antigen, we linked to the LACK expressing MIDGE 
vector a nuclear localization sequence signal (NLS) derived 
from the T antigen of the SV40 virus. The NLS peptide 
was conpled to one of the hairpin ODN in two steps as 
previously described [24]. 

3.2. Efficient LACK/p36 expression in cells transiently 
transfected with £ACK/p36-encoding plasmid, 
MIDGE and M1DCE-NLS 

Transient expression of p36 from plasmid pMOK-p36, 
the corresponding MIDGE construct and MTDGE-modified 
with the NLS sequence was tested in vitro in COS-7 cells 
transfected with DNA. The transfection was done using 
equimolar concentrations of each construct and the level 
of LACK/p36 expression was measured in lysed cells 
(Fig. IB). The MIDGE construct expressed about six-fold 
less LACK/p36 protein than the plasmid (Fig. IB and C). 
By contrast, NLS-modified MIDGE expressed a higher 
level of protein than unmodified MIDGE, but two-fold less 
than plasmid (Fig. 1C). 

3.3. Efficacy of MIDGE vectors to confer protection 
against leishmaniasis 

The efficacy of MIDGE vectors to trigger a protective 
immune response was tested in a murine model of Leish- 



mania infection. For these studies, we compared protocols 
based on the use of a conventional plasmid and simple or 
NLS-modified MIDGE vectors (Table 1). Mice were primed 
with different constructs encoding the p36 antigen from 
L infantum. Fourteen days later, half of the groups were 
boosted with rWp36 and the others with the three different 
DNA vectors (Table 1). Three weeks after boosting, all ani- 
mals were challenged with 5 x 10* late stationary phase L. 
major prpmastigotes in the right hind footpad (see Section 
2). Progression and size of the lesion were measured weekly. 
Mice were sacrificed 8 weeks after challenge because of the 
large size of the lesion in control groups. 

3.3.1. Priming/boosting with MIDGE vectors confers 
similar protection against L. major challenge than 
priming/boosting with a classical plasmid 

All immunized groups showed several degrees of protec- 
tion along the 8 weeks following the challenge (Fig. 2A). 
The protocol based on priming and boosting with MIDGE 
vector induced the same level of protection than that ob- 
served with the protocol based on priming and boosting with 
plasmid (Fig. 2A). Thus, both groups presented a reduction 
in lesion size of approximately 45% 8 weeks after challenge 
(Table 2). 

3.3.2. Priming/boosting with MIDGE-p36-NLS confers 
high protection against L. major im*ection 

When we compared the above protocols with prim- 
ing/boosting with MIDGE-NLS, we observed a higher 
reduction in lesion size with the latter protocol (Fig. 2A), 
The overall protection was almost 80% compared to the 
control group immunized with plasmids coding for an ir- 
relevant antigen (Mann-Whitney test, P = 0.0056) (Table 
2). By following the progression of the disease weekly, 
both MIDGE and MIDGE-NLS showed a good level of 
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Fig. 2. Course of L major infection in mice. (A) lb examim 
challenged s.c. in the right hind footpad with 5 x 10 4 live stationary phase i 
Lesion size was determined by measuring the thickness of the right footpad » 
the infected and the uninfected collateral footpads. The figure shows the me 
weekly intervals; (B) Lesion development at 8 weeks post-challenge. Black s< 
the mean value for each immunized group.. 
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protection up to week 5 post-challenge (Fig. 2A). However, 
after this time point, the lesion developed in the group 
primed/boosted with MIDGE-p36 while it did not progress 
in the group primed/boosted with MEDGE-p36-NLS. This 
was manifested after examination of lesion size in individ- 
ual animals (Fig. 2B). In fact, 57% (four out of seven) of the 
animals from this group had virtually no lesion (<0.5 mm) 
8 weeks after challenge. 

3.3.3. The protocol based on priming/boosting with 
MIDGE-p36-NLS induces at least as good protection 
against L major than priming with plasmid DNA vector 
and boosting with fWp36 

Although vaccination with DNA alone has proven ef- 
fective in animal models, it has been recently shown that 
a protocol based on priming with DNA vector and boost- 
ing with vaccinia virus recombinants expressing differ- 
ent genes induces high protection upon challenge with 
several pathogens including Leishmania [20,23,30-36]. 
We, therefore, compared the efficacy of priming/boosting 



beiween classical plasmid and MIDGE vectors with rW 
expressing p36. As shown in Fig. 2A, the extend of protec- 
tion triggered by the protocol based on prurdng/boosting 
with MIDGE-p36-NLS was similar to that induced by 
priming/boosting with pMOK-p36 and rWp36, show- 
ing not statistically significant differences between both 
groups. At the end of the experiment, the average re- 
duction in lesion size was almost 80% in the group im- 
munized with two doses of MTDGE-p36-NLS (Table 2), 
whereas the group immunized with pMOK-p36/rVVp36 
presented an average reduction of 64% compared to the 
group immunized with plasmids coding for an irrele- 
vant antigen (Mann-Whitney test, P - 0.0056 and P = 
0.0042, respectively) (Table 2). The difference in lesion 
size between MIDGE-p36-NI^/MroGE-p36-NLS and 
pMOK-p36/rVVp36 nnmunized animals was not significant 
(Mann-Whitney test, P - 0.22). 

When used only for priming, both MIDGE and 
MTDGE-NLS showed a good level of protection up to week 
5 post-challenge (Fig. 2A). However, after this time point, 
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the lesion developed very rapidly in the group primed with 
MIDGE-p36 and more progressively in the one primed with 
MDDGE-p36-NLS (Fig. 2 A), showing a reduction in. lesion 
size of 38 and 53%, respectively, 8 weeks after challenge 
as compared to the group immunized with two doses of 
pMOK-HBsAg (Mann-Whitney test, P = 0.52) (Table 2). 

3.4. Reduction in lesion size correlates with 
small lymph nodes 

Reduction in lesion size indicates that parasite replica- 
tion is diminished in the local lymph node. To test that, 
each popliteal lymph node from the infected leg was ex- 
tracted and weighted. Average lymph node weight was sig- 
nificantly lower in the two most protected groups (Table 2). 
In the group immunized twice with MnX>E-p36-NLS the 
average lymph node weight was almost 30% lower than that 
in controls and 40% lower in the group immunized with 
pMO£-p36 and rWp36 (Mann-Whitney test, P - 0.25 
and 0.0023, respectively). 

Interestingly, the group immunized with MIDGE-p36/ 
iVVp36 had the largest lymph node of all, even more than 
controls. This may reflect the rapid growth of me lesion in 
this group over the last 3 weeks of the analysis (Fig. 2A). Al- 
ternatively, a large lymph node could reflect an active lym- 
phocyte proliferation. This possibility could not be ruled out 
directly due to low number of cells recovered from the local 
lymph node. However, phenotypic analyses of splenocytes 
from all groups showed no differences in (he percentage of 
any cell subset or activation market (data not shown). These 
suggested the lack of a general immune activation. 

3.5. Humoral immune responses before L major 
challenge 

Since the outcome of the disease may be determined by 
the extent and the type of the immune response, we decided 
to check the antibody response before the challenge. Three 



weeks after boosting, serum was collected from each group 
of animals and the specific anti-p36 immunoglobulin G lev- 
els and their isotypes (IgGl and IgG2a) were evaluated. As 
positive control, two sera from animals infected with L. ma- 
jor were used. Specific anti-p36 total IgG was only detected 
in sera from animals immunized with protocols based on 
priming with DNAp36 (pMOK, HODGE or MTOGE-NLS) 
and boosting with rWp36 (Fig. 3). The main isotype pro- 
duced was IgG2a indicating a prevalent Thl immune re- 
sponse (Table 2). By contrast, control animals developed a 
characteristic Th2 immune response (Table 2). 

3.6. Humoral and cellular immune responses after L 
major challenge 

Since activation of a Thl type of immune response may 
be required to induce protection against Leishmania infec- 
tion, we were interested in characterizing the changes in 
immunoglobulin isotypes and cytokine production follow- 
ing challenge in animals. Spleens and sera were taken from 
sacrificed mice and specific IgG antibodies, and IFN-7 and 
IL-4 production were evaluated. Analysis of specific IgG 
levels and their isotypes revealed that all groups had devel- 
oped anti-p36 total IgG as expected because all of them had 
been challenged with the parasite. The levels were slightly 
higher in the groups primed and boosted with MTDGE-p36 
and MTDGE-p36-NLS (Fig. 3). Most importantly, the high- 
est overall IgG2a/IgGl ratios were present in the groups mat 
showed the highest levels of protection, indicating a preva- 
lence of a Thl immune response (Table 2). 

In order to further correlate the generation of a Thl re- 
sponse with protection, pooled splenocytes from each group 
were stimulated with either recombinant p36 protein, an im- 
munodominant class n-specific peptide from this protein 
[29], a lysate of Leishmania parasite or Concavalin A as 
positive control. IFN-7 (a Thl cytokine) and EL-4 (Th2) pro- 
duction was determined in the supernatants 72 h after stim- 
ulation by ELISA. No IL-4 could be detected in any of the 




Fig. 3. Specific IgG antibodies before and after challenge with L major. Sera were collected 1 day before challenge with L major parasite and 8 weeks 
post-challenge. Specific anri-p36 IgG antibodies were analysed by ELISA. Black bars represent the mean ± S JD. of three mice per group (pre-challenge) 
and grey bars represent the mean dfc S.D. of 7-8 mice per group (post-challenge). Groups l-« are defined in Table 1 , Group 9 represents serum from a 
at infected with 5 x 10 4 live stationary phase L. major promastigotes (the mean ± S.D. of triplicate wells is presented). Group 10 
n a naive mouse (the mean ± S.D. of triplicate we 
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Fig. 4. IFN-7 production by in vitro stimulated splenocytes. Eight weeks after challenge, spleens were removed and pooled in each group. Cells were 
stimulated in vitro with recombinant LACK protein, the main immunogenic peptide from LACK [14J. soluble leislunanian antigen (SLA) or ConA. 
Seventy-two hours post-stimulation supematants were harvested and IFN-7 production was assessed by specific ELISA. The concentration of IFN--y was 
calculated by extrapolating the absorbance value to a standard curve made with known concentrations, 'values between groups were standardized by using 
the production after stimulation with ConA as the maximum value. Bars represent the percentage of IFN-7 production relative to the production 
after ConA stimulation. Groups 1-8 are those defined in Table 1. NDr not determined. 



groups presumably due to the low sensitivity of the assay. 
By contrast, relatively high levels of IFN-7 were observed 
in all groups after stimulation with LSA. Stimulation with 
the p36 protein or the peptide induced a higher WN~y pro- 
duction in the groups showing the best protection {Fig- ■*)- 



4. Discussion 

DNA immunization represents a novel and interesting 
approach to vaccine development and immunotherapy, 
particularly for applications where cell-mediated immune 
responses are required. Several obstacles are hampering the 
translation of this promising technology to the clinic, includ- 
ing safety concerns associated to the use of conventional 
plasmids and the high doses required to obtain protective 
immunization. We and others have demonstrated that the use 
of plasmid DNA as priming agent followed by boosting with 
recombinants of vaccinia virus coding for the same antigen 
represents a very effective protocol for immunization and 
protection {20,23,30-36]. Although this strategy allows a 
considerable reduction in the amount of DNA required, the 
use of recombinant viruses abrogates some of the advantages 
of DNA vaccination. In the present work we have studied 
the ability of minrmalistic, immunogemcally-defined, gene 
expression vectors to induce protection in a mouse model 
of Leishmania infection. MIDGE vectors offer several sig- 
nificant advantages over conventional plasmids, such as 
small size, absence of antibiotic resistance genes and selec- 
tive target sites for chemical linkage of peptides, proteins, 
sugars, etc. [24]. The efficacy of MIDGE vectors was com- 
pared to that of plasmids in several immunization protocols 
including boosting or not with rVV (Table 1). 



The very conserved LACK antigen from L. infan- 
tum was cloned into both a plain MIDGE vector and a 
MEDGE-modified by covaletitfy-1 inked nuclear localization 
signals (NLS) (Fig. 1A). NLS has been shown to increase 
the translocation of both proteins [37] and DNA [25-27] 
from the cytoplasm to the nucleus thus bypassing one of 
the main barriers for (he expression of foreign DNA. lb test 
whether this increase in nuclear transport correlated with a 
higher antigen expression, COS-7 cells were transfected in 
vitro and LACK expression was detected by Western blot- 
ting. As shown in Fig IB, the addition of one copy of NLS 
resulted in a three-fold increase in antigen expression com- 
pared to plain MIDGE. However, mis expression was still 
lower than that achieved by transaction with an equimolar 
amount of plasmid coding for the same protein, indicating 
that increased transport to the nucleus is not sufficient to 
achieve high antigen expression. 

MIDGE vectors expressing the LACK protein were then 
used for in vivo immunization of Balb/c mice and their ef- 
ficacy in inducing protection was determined by challeng- 
ing the mice with promastigotes from L. major. This is a 
well established model that has been extensively used to 
study the immunological parameters involved in Leishma- 
nia infection and protection as well as for (he validation of 
vaccine candidates against this parasite [3,6]. As shown in 
Fig. 2, two doses of plain MIDGE-p36 confer partial but 
sustained protection against the challenge. This protection 
is identical to that induced by two doses of plasmid, but it 
is achieved with only half the amount of DNA (Table 1). 
The development of the lesion also follows the same kinetic 
in both groups of immunized animals (Fig. 2A) suggesting 
that these vectors induce both quantitatively and qualita- 
tively similar immune responses in spite of the lower anti- 
gen expression observed in MIDGE-transfected cells in vitro 
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(Fig. 1 B). Most importantly, the best protection against chal- 
lenge was observed in the group immunized with two doses 
of NLS-modified MIDGE vector (Fig. 2A). In this group, 
four out of seven mice were completely protected at the ter- 
mination of the study and only one presented a lesion of con- 
siderable size (Fig. 2B). These results were even better than 
those obtained by priming with plasmid and boosting with 
rVV, which is considered to be one of the best immunization 
protocols available 136,38], In addition, both MIDGE-p36 
and MTDGf>p36-NLS conferred a good but short protec- 
tion against challenge with L major when used as priming 
vectors followed by boosting with rWp36 (Fig. 2). Mice 
from these groups were completely protected up to 5 weeks 
post-challenge but from this time the lesion developed very 
rapidly, particularly in the group primed with MDDGE-p36 
(Fig.2A). 

Trying to understand the immunological mechanisms un- 
derlying these results, we analyzed serum from three mice 
in each group for specific anti-LACK IgG antibodies and 
their isotypes at the time of challenge. It has been previously 
shown by many authors that in this model of Leishmania 
infection, there is a clear correlation between resistance to 
infection and the development of a Thl type of response, 
whereas susceptibility correlates with the development of a 
Th2 response [3,5]. Although protection against Leishma- 
nia is not mediated by antibodies [7], IgG isotypes are a 
valid downstream indicator of the type of T-helper immune 
response generated. Specific IgG antibodies were only de- 
tected in those groups receiving rWp36 at the time of boost- 
ing and not in those receiving two doses of DNA (Fig. 3). 
The explanation for these results could be that LACK is a 
cytoplasmic protein that cannot be detected by B cells when 
expressed from a DNA vector. By contrast, Vaccinia virus 
induces lysis of infected cells [21] and so the whole protein 
becomes exposed to the immune system. The IgG antibodies 
produced by mice boosted with rV Vp36 were predominantly 
of the IgG2a isotype (Table 2), indicating the induction of a 
Thl response, as is normally the case when using rW [21 j. 

In contrast to these results, all mice had specific 
anti-LACK antibodies 8 weeks after challenge (Fig. 3), 
implying that replication of die parasite and cell lysis had 
occurred in all of them. This was also suggested by the 
increase in size of the local lymph node at the termina- 
tion of the experiment, which was particularly important in 
those groups with the lowest protection (Table 2) and which 
could not be attributed to a higher immune response (data 
not shown). Interestingly, the groups immunized with two 
doses of any DNA vector had the highest antibody levels. 
This suggests that, although undetectable at the time of the 
analysis, DNA immunization had probably induced a low 
and/or transient production of antibodies that was boosted 
by the infection. However, as mentioned above, antibodies 
do not seem to be involved in protection because their level 
before or after challenge did not correlate with lesion devel- 
opment Specific IgG isotypes were also determined at the 
time of sacrifice. As shown in Table 2, those groups with a 



higher IgG2a/TgGl ratio (indicating a bias towards a Thl 
response) were the best protected at week 8 post-challenge. 
To further correlate protection with the generation of a Thl 
response, pooled splenocytes from every group of mice 
were re-stimulated in vitro with different stimuli and the 
IFN--y production (a Thl cytokine) was measured in the 
supematants. Results were not very conclusive, probably 
because by week 8 after challenge the immune system had 
time to generate a response against many antigens from the 
parasite. As a consequence, the specific anti-LACK cellular 
response, important in the early stages of the infection, may 
be difficult to detect This is demonstrated by the fact that 
splenocytes from most groups were able to produce INF-7 in 
response to a whole Leishmania lysate (Fig. 4). Nonetheless, 
it seems mat only splenocytes from groups showing the best 
protection (group 3: MIDGE-p36-NLS/MIDGE-p36-NLS 
and group 5: pMOK-p367rWp36) produced a good amount 
of IFN-7 in response to LACK (Fig. 4). More experiments 
have to be done to confirm this correlation, preferably soon 
after the challenge because the anti-LACK response nor- 
mally occurs very early after infection and so it may be at 
this point when generation of the right (Thl) response is 
critical for controlling replication of the parasite. 

Altogether, we have shown that MIDGE vectors are 
very efficient at inducing protection against challenge in 
a well-studied murine model of Leishmania infection. Al- 
though in vitro, the plain MIDGE vector induced lower 
protein expression than plasmid, in vivo they both conferred 
similar protection. This suggests that antigen load may not 
be the only limiting factor when vaccinating with DNA and 
that MIDGE vectors have an unknown feature that makes 
them immunologically more efficient Alternatively, it could 
be that MIDGE vectors induce higher protein expression 
in vivo or that the amount of antigen expressed after two 
doses is above the threshold required for die induction of 
an immune response. 

When given only for priming, half the amount of both 
plain MIDGE and NLS-modified NUDGE vectors in combi- 
nation with rW conferred almost full protection during the 
first 5 weeks post-infection. After this time, however, the 
parasite took over and the lesion developed very quickly in 
these two groups. By contrast, when plasmid was used for 
priming, the infection was controlled over a long period of 
time. It is not clear why priming with plasmidDNA followed 
by boosting with recombinant vaccinia vims is such an ef- 
fective protocol for inimunization whereas both used in the 
opposite order are not effective. One of the most plausible 
explanations is that rW given first strongly primes many 
anti-viral T and/or B cells and so dilute the antigen-specific 
ones, which are then very difficult to re-amplify by the DNA 
booster [21—40]. When DNA is given first, the response 
maybe low but specific against the cloned antigen and so 
can be easily amplified by the very immunogenic rVV [40]. 
Our results indicate that to achieve a strong and long-lasting 
immune response with this protocol it may be necessary to 
generate enough antigen during the pruning step. This would 
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explain why plasmid was better than MIDGE vectors as 
priming agent and why the group primed with NLS-modified 
MIDGE contained the infection slightly better than the one 
primed with plain MIDGE. Experiments are underway to 
test Ibis possibility by using larger amounts of MIDGE vec- 
tors to achieve similar antigen load than when priming with 
plasmid. 

The most important outcome from this work was 
the strong protection obtained with only two doses of 
NLS-modified MIDGE coding for p36. To our knowledge, 
this is the first time that an immunization protocol based 
only on DNA gives comparable or even better protection 
than one based on plasmid DNA 4- rW. This result can 
not be attributed to a higher antigen load induced by the 
MIDGE vector system because, as already emphasized, 
plasmid was better for protein expression at least in vitro 
{see above). Most likely, the presence of the NLS peptide 
covalently linked to the DNA is making the vector qualita- 
tively different In fact, when working on a different model, 
we have observed that immunization with NLS-modified 
MIDGE vectors predominantly induce a Thl type of re- 
sponse in mice, even in conditions where other vectors 
induce a Th2 response (Moreno et al., manuscript in prepa- 
ration). The basic mechanism underlying this bias is still 
unclear, but it could explain the good protection conferred 
against Leishmania infection, reported to be highly depen- 
dent on the generation of a Thl response [3], Ahhough 
the design of the present experiment, more focused on 
protection than on basic immunological studies, does not 
allow for a more precise detennination of the type of im- 
mune response generated, the data available suggest that 
the most protected groups had more Thl response. On the 
basis of all the data, we can conclude that a long-lasting 
protection against Leishmania infection from immunization 
with the LACK antigen first requires a certain amount of 
antigen to trigger the specific immune response. Once gen- 
erated, only if the response is of the Thl type, the infection 
will be controlled. 

In summary, the evidence presented here shows that 
MIDGE vectors are a true alternative to plasmids and re- 
combinant viruses for protective rmmunization. MIDGE 
vectors offer great hope for the development of effective 
and safe vaccines against Leishmania and other devastating 
infections for which conventional vaccines have no effect 
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ABSTRACT Translocation of exogenous DNA through the 
nuclear membrane is a major concern of gene delivery tech- 
nologies. To take advantage of the cellular import machinery, 
we have synthesized a capped 3.3-kbp CMVLuciferase-NLS 
gene containing a single nuclear localization signal peptide 
(PKKKRKVEDPYC). Transfection of cells with the tagged 
gene remained effective down to nanogram amounts of DNA. 
Transfection enhancement (10- to 1,000-fold) as a result of the 
signal peptide was observed irrespective of the cationic vector 
or the cell type used. A lysine to threonine mutation of the 
third NLS amino acid completely abolished these remarkable 
features, suggesting importin-mediated translocation. Our 
hypothesis is that the 3-nm-wide DNA present in the cyto- 
plasm is initially docked to and translocated through a 
nuclear pore by the nuclear import machinery. As DNA enters 
the nucleus, it is quickly condensed into a chromatin-like 
structure, which provides a mechanism for threading the 
remaining worm-like molecule through the pore. A single NLS 
signal is thus sufficient, whereas many signals on a gene would 
actually inhibit entry, the same DNA molecule being threaded 
through adjacent pores. 



The nuclear membrane of eukaryotic cells is freely permeable 
to solutes of up to about 9 nm (e.g., 40- 60-kDa proteins). 
Transport of larger molecules through nuclear pores is signal- 
mediated, involves shuttle molecules, and requires energy. The 
basic peptide derived from the simian virus 40 large tumor 
antigen (PKKKRKV) is a nuclear localization signal (NLS) 
that mediates binding of the karyophilic protein to importin a 
(1). Complex formation triggers binding to importin /3 and the 
ternary complex is then carried through the nuclear pore with 
the help of the GTPase Ran. Macromolecules and particles up 
to 25 nm have been shown to enter the nucleus in this way, 
although the translocation mechanism is still not fully under- 
stood (2-5). 

Eukaryotic DNA viruses, which replicate in the nucleus, 
seem to be capable of diverting the cell's nuclear import 
machinery to their own benefit (6, 7). Recombinant viruses 
derived from these viruses are being used to carry therapeutic 
genes into humans. However, the host's immune response is 
currently a limitation to the clinical development of viral gene 
therapy. Nonviral alternatives using pi asmid DNA and cationic 
carrier molecules suffer from a different drawback, the low 
efficacy of gene delivery. The main barrier to transgene 
expression in vitro is the nuclear membrane (8-10). Because 
breakdown of this membrane during cell division helps nuclear 
localization, this obstacle is probably still more a problem in 
vivo, where cells can be considered to be resting with respect 
to the lifetime of DNA. 
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Several attempts to improve entry of plasmid DNA into the 
nucleus have been published. These include use of electrostatic 
binding of DNA to cationic NLS-containing proteins (11, 12) 
or peptides (13) or lipids (14), as well as sequence-specific 
binding of DNA to karyophilic proteins (15-17). Yet such 
"piggyback" nuclear transport relies on the unpredictable 
stability of the complexes within the cytoplasm. Recent work 
by the group of Ion Wolff (18) with digitonin-permeabilized 
ceils demonstrated nuclear accumulation of fluorescently la- 
beled DNA that was randomly tagged with hundreds of NLS 
peptides; nuclei of intact cells did not take up the modified 
DNA Our approach took advantage of a chemically controlled 
pathway to irreversibly link a single NLS-peptide to one end of 
a gene. Moreover, following Heisenberg's principle, we 
avoided taking a too close look at nuclear import using 
potentially disturbing techniques. The goal being gene deliv- 
ery, the level of transgene expression after transfection was 
chosen as the most pertinent test of success. 

EXPERIMENTAL PROCEDURES 

Chemicals, Enzymes, and Oligonucleotides. 4-(7v"-MaIeim- 
idomethyl)cyclohexane-l-carboxylic acid A'-hydroxysuccinim- 
ide ester (SMCC) was purchased from Sigma. The SAL 34-mer 
5'-d(TCGATGTCCGCGTTGGCTTXTGCCAACGCGGA- 
CA) oiigodeoxynucleotide was synthesized by Appligene by 
using an amino-modified deoxythymidine (X; amino-modified 
dT, Glen Research, Sterling, VA). The XMA 34-mer 5'-d(C- 
CGGCTACCTTGCGAGCTTTTGCTCGCAAGGTAG) 
oiigodeoxynucleotide, the NLS peptide NH 2 -PKKKRKVED- 
PYC, and the mutated-NLS peptide NH 2 -PKTKRKVEDPYC 
with C-terminal amidation were synthesized by Genosys (The 
Woodlands, TX), Hairpin structures composed of a loop of 
four thymines, a stem of 13 bp, and a sticky 5' end were formed 
by boiling and subsequently cooling the XMA or the SAL 
oligonucleotides in ice. Xmal, Xmnl, Sail, and BspBl restric- 
tion endonucleases, T4 polynucleotide kinase, T4 DNA ligase, 
and exonuclease III were purchased from New England Bio- 
labs. Linear 22-kDa (ExGen500) and branched 25-kDa poly- 
ethylenimines (PEI) were purchased from Euromedex 
(Souffelweyersheim, France) and Fluka (Saint-Quentin Fal- 
lavier, France), respectively. Transfectam (dioctadecylamido 
glycylspermine) was synthesized as described (19). 

Preparation of the Oligonucleotide-Peptide Conjugate. Two 
A^o units (6.46 nmol, assuming e 2 eo = 309,800 M _1 -cm _1 ) of 
the amino-modified SAL oligonucleotide in 20 jxl of phosphate 
buffer [10 mM sodium phosphate (pH 7.5)] was mixed with a 
40:1 molar excess of the bifunctional crosslinker SMCC in 
dimethylformamide (as a 30 mM stock solution) and incubated 



Abbrevialions: NLS, nuclear localization signal; SMCC, 4-(Af- 
maleimidomethyl)cyclohexane-l-carboxylic acid iV-hydroxysuccinim- 
ide ester; PEI, polyethylenimme; N/P, nitrogen/phosphate. 
*M.A.Z. and P.B.-V. contributed equally to this work. 
tTo whom reprint requests should be addressed at: Laboratoire de 

Chimie Genetique, Faculte de Pharmacie, BP 24, F-67401 Illkirch, 

France, e-mail: behr@aspirine.u-strasbg.fr. 
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at room temperature for 2 h. Excess SMCC was removed with 
a Nick-Spin column (Amersham-Pharmacia) that had been 
equilibrated withPBS. The recovered oligonucleotide solution 
(100 /xl) was immediately reacted with a 10-fold molar excess 
of NLS-peptide (or mutated-NLS-peptide) overnight at room 
temperature and then stored at -20°C. The oligonucleotide- 
peptide conjugate was purified by preparative PAGE. The 
coupling yield was 30%, based on quantification of the radio- 
labeled oligonucleotides after migration in a 20% denaturing 
polyacrylamide gel. To assess the peptide content, 1 pmol of 
radiolabeled oligonucleotide-NH 2 or oligonucleotide-NLS was 
incubated in 10 mM Tris-HCl, pH 8/1 mM EDTA in the 
presence of proteinase K (0.5 mg/ml) for 10 min at 37°C, 
followed by incubation for 10 min at 65°C and subsequent 
loading on a denaturing polyacrylamide gel. Visualization and 
quantification were performed with a Phosphorlmager 425 
(Molecular Dynamics). 

Ligation of the CMVLuc Restriction Fragment to the Hair- 
pin Oligonucleotides. pCMVLuc plasmid, encoding the Pho- 
tinus pyralis luciferase under the control of the cytomegalovi- 
rus enhancer/promoter and followed by the simian virus 40 
early polyadenylation signal (a gift from B. Demeneix, Mu- 
seum National d'Histoire Naturelle, Paris) was propagated and 
purified as described (20). Xmnl/Xmal double digestion (10 
U/jag of DNA) was performed at 37°C for 2 h, followed by 
enzyme heat inactivation (65°C for 20 min), before Sail 
cleavage (37 D C for 2 h; 10 U/jug of DNA). After removal of the 
endonucleases by DNA precipitation, BspWL digestion was 
performed for 2 h at 37°C (10 U/jxg of DNA). Separation of 
the CMVLuc-containing DNA fragment (3,380 bp) from the 
shorter restriction fragments was performed by ultracentrifu- 
gation (30,000 rpm, SW41 rotor, 19 h at 25°C, Beckman 
Ultracentrifuge L8/55, France) in a 15-30% sucrose gradient. 

The 5' end of the oligonucleotide-peptide carrying the NLS 
or the mutated NLS peptide was radiolabeled with [7- 32 P]ATP 
and T4 polynucleotide kinase (1 U/pmol of oligonucleotide at 
37°C for 30 min). The 5' end of the oligonucleotide-cap was 
phosphorylated similarly with ATP. Excess [y- 32 P]ATP and 
ATP were removed with Microspin G-25 columns (Amer- 
sham-Pharmacia). Before ligation, the hairpin form of the 
oligonucleotides presenting a sticky 5' end was obtained by 
boiling and subsequently cooling the sample in ice. Ligation of 
the CMVLuc fragment with the oligonucleotide-cap and the 
oligonucleotide-peptide was performed overnight at 13°C with 
a 15-fold molar excess of each oligonucleotide and T4 DNA 
ligase (10,000 U/iig of DNA). The excess oligonucleotide was 
removed with a Microspin S-400 HR column (Amersham- 
Pharmacia). Quantification of the ligase reaction yield (80- 
90%) was performed by Cerenkov counting (TRI-CARB 2100 
TR Liquid Scintillation Analyzer, Packard). Capping of the 
CMVLuc fragment was checked by digestion with exonuclease 
III (10 U/jtig of DNA) at 37°C. Agarose gel electrophoresis 
showed the uncapped and hemicapped fragments to be totally 
digested, whereas the capped fragment remained undigested. 

Cells and Cell Culture. NTH 3T3 murine fibroblasts were 
purchased from American Type Culture Collection and grown 
in DMEM (GIBCO/BRL). BNL CL.2 murine hepatocytes 
were provided by E. Wagner (Bender, Wien, Austria) and 
grown in DMEM, high glucose (4.5 g/liter). HeLa human 
cervix epitheloid carcinoma cells (provided by L. Monaco, 
Istituto San Raffaele, Milano, Italy) were grown in MEM with 
Earle's salt (PolyLabo, Strasbourg, France). Human mono- 
cytes were isolated from the blood of a healthy donor by 
elutriation/cytapheresis and differentiated into macrophages 
by 1-week culture in RPMI 1640 medium supplemented with 
100 U/ml GM-CSF. Dorsal root ganglia neurons from new 
born rats were obtained and grown as described (21). Cell 
culture media were supplemented with 10% fetal calf serum 
(GIBCO/BRL), 2 mM L-glutamine, penicillin (100 units/ml), 



and streptomycin (100 /Ag/ml; GIBCO/BRL). Cells were 
maintained at 37°C in a 5% COj humidified air atmosphere. 

Cell Transfection. For each cell line used, 10,000 cells per 
well were seeded 24 h before transfection in 96-well tissue 
culture plates (Costar) to reach 60-70% confluence during 
transfection. Before transfection, cells were rinsed and 0.2 ml 
of fresh culture medium supplemented (transfection in the 
presence of serum) or not (transfection in the absence of 
serum) with 10% fetal calf serum was added to each well. The 
desired amount of DNA was diluted into 46 ju.1 (final volume) 
of 0.15 M NaCl or 5% glucose. The desired quantity of 
ExGen500, 25-kDa PEI, or Transfectam (from a 1 mM aque- 
ous amine nitrogen stock solution of PEI or a 2 mM ethanolic 
stock solution of Transfectam) was then added to the DNA- 
containing solutions, vortex-mixed gently, and centrifuged. 
After 10 min, volumes corresponding to 10, 20, or 200 ng of 
CMVLuc fragment (10 ng//xl of DNA) or to the gene-number- 
corrected amount of plasmid DNA were added to the cells. 
The cell culture dish was immediately centrifuged (Sigma 
product) for 5 min at 1500 rpm (280 x g) or 500 rpm for 
primary neurons. After 2-3 h, 20 ju,l of fetal calf serum was 
added to the serum-free wells. Cells were cultured for 24 h and 
tested for reporter gene expression. All experiments were done 
in duplicate. 

Luciferase Assay. Luciferase gene expression was measured 
by a luminescence assay. The culture medium was discarded 
and cell lysate was harvested after incubation of cells for 30 
min at room temperature in 100 {A of Lysis Reagent lx 
(Promega). The lysate was vortex-mixed gently and centrifuged 
for 5 min at 14,000 rpm at 4°C (Sigma 3K10). Twenty micro- 
liters of supernatant was diluted into 100 jxl of luciferase 
reaction buffer (Promega) and the luminescence was inte- 
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Fig. 1. Strategy for the preparation of a double-stranded DNA 
fragment coupled to an NLS peptide. A functional luciferase gene of 
3,380 bp was cut out of pCMVLuc with Xmal and Sail. Further 
digestion with Xmnl and Bspm cut the unwanted restriction fragment 
into small fragments (970, 875, 768, and 240 bp) that were removed by 
sucrose gradient centrifugation. The capped CMVLuc-NLS DNA was 
obtained by ligation of the 32 P-labeled (*) oligonucleotide-peptide and 
oligonucleotide-cap hairpins to the restriction fragment. 
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Fig. 2. Reaction scheme for the chemical coupling steps leading to the oligonucleotide-peptide conjugate (oligo-NLS). A hairpin oligonucleotide 
with a free alkylamino group in the T 4 loop (oligo-NH 2 ) was reacted with the heterobifunctional crosslinker SMCC to give a thiol-reactive maleimide 
oligonucleotide (oligo-Mal), which was in turn reacted with the C-terminal cysteinamide residue of the NLS dodecapeptide. 

grated over 10 sec (Mediators, Wien, Austria). Results were 
expressed as light units per mg of cell protein (BCA assay, 
Pierce). 

RESULTS 

Synthesis of a Capped Gene-Peptide Conjugate, Purifica- 
tion, and Proof of Structure. The construction was based on 
ligation of a pair of hairpin oligonucleotides to unique cohesive 
termini generated on the reporter gene (Fig. 1). Incorporation 
of 32 P into the modified oligonucleotide allowed us to follow 
the reaction kinetics, to purify the fragment of interest, and to 
verify its structure. The firefly luciferase reporter gene (Luc) 
flanked by the cytomegalovirus (CMV) enhancer/promoter 
sequence and the simian virus 40 polyadertylation signal was 
excised from the pCMVLuc plasmid (Fig. 1). Quadruple 
endonuclease digestion ensured straightforward large-scale 
separation of the 3,380-bp CMVLuc fragment from the re- 
maining <l-kbp fragments by ultracentrifugation through a 
15-30% sucrose gradient. T 4 loops are well suited for hairpins 
and the C-terminal thiol group of the PKKKRKVEDPYC 
peptide was conjugated to a thymine with a C(5) amino group 
(22) via an activated ester/maleimide bifunctional linker 
(SMCC) as detailed in Fig. 2. Preliminary experiments showed 
that the global reaction yield was highest after activation of 
oligonucleottde-NH 2 with SMCC for 2 h (significant parasitic 
maleimide hydrolysis may occur with time). The 5' 33 P radio- 
labeling and polyacrylamide gel electrophoresis of oligonucle- 
otides (Fig. 14) showed the peptide conjugation reaction to be 
completed after 3 h, giving 30% oligonucleotide-NLS, based 
on the starting oligonucleotide. Proteinase K digestion con- 
verted oligonucleotide-NLS to a faster migrating compound, 
presumably the oligonucleotide conjugated to the C-terminal 
amino acid (oligonucleotide-Mal-Cys), thus establishing the 
chimeric nature of the conjugate (Fig. 35). The oligonucleo- 
tide-NLS was purified by electrophoresis. The uncapped CM- 
VLuc fragment was then simultaneously reacted with a 15-fold 
excess of oligonucleotide-cap and oligonucleotide-NLS by 
using T4 DNA ligase in previously optimized conditions. 
Ligation reaction yield (80-90%) was assessed by quantitative 
radioactivity counting and full capping of the CMVLuc frag- 
ment was checked by 3' exonuclease digestion. CMVLuc-NLS 
was purified by gel permeation. 

Transfection with Small Amounts of DNA. The reporter 
gene construct was obtained by chemical/enzymatic synthesis 
and purified by PAGE/gel permeation, instead of being 
amplified in bacteria. Only limited amounts of DNA could, 




Fig. 3. (A) Synthesis of oligonucleotide-NLS. PAGE analysis of 
the formation of the oligonucleotide-peptide conjugate is shown. 
Aiiquots of the reaction mixture were taken at different times and 
analyzed on a 20% denaturing gel after radiolabeling of the oligonu- 
cleotides. Lanes: I, oligonucleotide-Nflb; 2, oligonucleotide-NH2/ 
SMCC reaction mixture after 2 h; 3-5, reaction mixture 1.5 h, 3 h or 
overnight after addition of the NLS-peptide to the oligonucleotide- 
NH 2 /SMCC mixture, respectively. (B) The presumed oligonucleotide- 
N1S conjugate is a substrate of proteinase K. Radiolabeled oligonu- 
cleotide-NH 2 or the crude oligonucleotide-NLS reaction mixture were 
digested with proteinase K. Products were analyzed on a 20% dena- 
turing gel and show total conversion of oligonucleotide-NLS into a 
faster migrating band, presumably oligo-Mal-Cys. 
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therefore, be obtained, and the transfection setup had to be 
miniaturized. A convenient 96-well microtiter plate assay 
developed by Feigner et al. (23) was chosen. Using several 
optimized cationic lipid formulations, these authors showed 
that transfection was best at 2-0.5 u.g of plasmid and quickly 
fell off below 0.25 /jtg. We confirmed this result with pCMVLuc 
and two other cell types (Table 1, entry 4, and Fig. 4), thus 
putting our ultimate goal to obtaining an effective transfection 
with less than 200 ng of DNA. As a preliminary test, we looked 
at the relevance to cap the reporter gene at both ends, thus 
avoiding free DNA ends that are prone to exonuclease- 
mediated DNA degradation and to eliciting a DNA repair 
response. To this end, the uncapped, hemicapped, and fully 
capped CMVLuc genes (200 ng) were transfected into hepa- 
tocytes. Luciferase activities (Table 1) showed both un- and 
hemicapped molecules to give 25-fold less gene expression 
than the capped one thus justifying the choice of our chemical 
strategy. 

The NLS Peptide Allows Effective Transfection with Minute 
Quantities of DNA. 3T3 cells were transfected with decreasing 
amounts of DNA — the NLS-bearing capped gene (CMVLuc- 
NLS), the capped gene (CMVLuc), and the corresponding 
mass-corrected amount of plasmid DNA (pCMVLuc). Al- 
though efficacy very much decreased for "20-ng" plasmid 
DNA (in fact 38 ng), as expected (see above), transfection 
levels remained remarkably high and constant over the range 
of 200-10 ng of DNA with CMVLuc-NLS (Fig. 4). Compar- 
ison with the capped gene lacking the NLS peptide showed 
100- to 1,000-fold more expression when the NLS peptide was 
present on the gene. Similar conclusions were obtained, by 
using a cationic lipid (Transfectam, Fig. 4 Left) or a cationic 
polymer (branched PEI, Fig. 4 Right). Further data (not 
shown) confirmed that other cationic carrier molecules be- 
haved similarly but also that efficacy dropped 50-fold within 
the range of 10-1 ng of CMVLuc-NLS. 

Improved Transfection Involves the Cellular Nuclear Im- 
port Machinery. Comparative experiments between CMVLuc 
and CMVLuc-NLS, although interesting, do not really allow 
one put forward any hypothesis concerning the mechanism. 
Nuclear import of DNA and oligonucleotides has mainly been 
followed with fluorescent tags and fluorescence in situ hybrid- 
ization, in microinjected or digitonin-permeabilized cells. 
Proof of active import relied on inhibition experiments with 
energy-decoupling molecules, nuclear pore-binding aggluti- 
nins, or mutations in the signal peptide sequence. Gene 
expression is an unambiguous proof of nuclear localization and 
transfection can be regarded as a straightforward way of 
introducing DNA into intact cells. One lysine to threonine 
mutation abolishes nuclear import (24). We therefore synthe- 
sized a capped gene (CMVLuc-mNLS) with a mutated PKT- 
KRKVBDPYC sequence. Comparative transfection experi- 
ments with HeLa cells are shown in Fig. 5. Irrespective of the 
amount of DNA used, the lysine to threonine mutation 

Table 1. Free DNA ends decrease efficacy 

Transfection efficiency, 
DNA form Symbol RLU/mg of protein 

Uncapped CMVLuc 

Hemi-capped CMVLuc 

CMVLuc 

Circular pCMVLuc 
plasmid 




9 ± 2 X 10 4 
2.5 ± 0.6 X 10 s 



2.5 ± 0.2 x 10 s 
3 ± 1 X 10 4 * 



BNL CL.2 hepatocytes were transfected with 200 ng of DNA per 
weU. Complexes were formed with 25-kDa PEI (N/P = 10) in 150 mM 
NaCl and in the absence of . serum. RLU, relative light units. 
"Transfection with 20 ng of plasmid. 
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Transfedam PEI 25 kDa 

1H 200 ng HI 20 ng □ 10 ng DNA/ well 

Fig. 4. NLS peptide promotes high and sustained transfection 
levels down to 10 ng DNA. 3T3 cells were transfected with decreasing 
amounts of DNA complexed in 150 mM NaCI to a cationic lipid 
(Transfectam, vector nitrogens over DNA phosphate, N/P = 6) or to 
a cationic polymer (25-kDa PEI, N/P = 10) in the absence of serum. 

brought transfection down to the level obtained with the 
capped gene having no peptide at all, confirming the func- 
tionality of the NLS-importin a interaction. 

NLS Peptide-Mediated Transfection Enhancement Is a 
General Phenomenon. A series of comparative experiments 
was undertaken with various cell types. Transfection enhance- 
ment due to the presence of the NLS peptide was always 
observed (Table 2). However, enhancement factors were 
spread widely with cell type (10- to 1,000-fold), with no obvious 
relation with tissue origin nor cell type (primary vs. trans- 
formed cells). Nondividing primary cells such as human mono- 
cyte-derived macrophages and rat dorsal root ganglia neurons 
showed less impressive enhancement than 3T3 and HeLa cells. 
However, similar values were also seen for the easily trans- 
fected and fast dividing rat hepatocyte-derived cell line BNL 
CL.2. 
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Fig. 5. Sustained luciferase expression levels are due to the nuclear 
localization peptide. HeLa cells were transfected with various amounts 
of DNA complexed to ExGen500 PEJ (N/P = 5 in a 5% glucose 
solution) in the presence of 10% serum. 
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Table 2. Average enhancement of transfection with CMVLuc-NLS 



HeLa 

Murine DRG a 
Human macrop 



4001-1000 
200*-300 



10 



Values refer to transfection with 10-20 ng of DNA per well, in the 
absence of serum. DRG, dorsal root ganglion. 
'Estimated spread is ±30%. 
fin the presence of 10% serum during transfection. 
$ln 24-well plates using 200 ng of DNA in the presence of 10% serum. 

The general experimental setup for transfection included 
96-well microliter plates and no serum for 2 h after addition 
of the DNA-vector complexes to the cells. Several experiments 
were also performed on a iarger scale (24-well plates) or in the 
presence of 10% serum during transfection. The results ob- 
tained for HeLa and 3T3 cells (Table 2) confirmed the general 
conclusions derived with the 96-well setup. 

Time Course of the Transgene Expression. Active transport 
of the reporter gene via the nuclear import machinery could 
result in faster appearing expression. More interesting, a lower 
intracellular barrier to gene delivery means less DNA-vector 
complexes required in the cytoplasm and hence lower toxicity 
and more sustained expression. The kinetics of gene expres- 
sion was followed in detail up to 24 h (Fig. 6) and then daily 
up to 3 days. Again, transfection with CMVLuc-NLS was much 
more effective than with CMVLuc-mNLS. Remarkably (and 
reproducibly), NLS-driven transfection reached its plateau 
after 12 h, whereas transfection levels obtained with the 
plasmid or the mutated NLS sequence were still increasing 
significantly up to 24 h. Previous experiments (25) showed both 
transfection and cytoplasmic injection of DNA-PEI complexes 
to have superimposable kinetics of transgene expression, sug- 
gesting that the slow step was intracellular trafficking/nuclear 
entry rather than cell entry. Using the nuclear import machin- 
ery seems to speed up the rate-limiting step. For longer time 
periods, expression stayed broadly constant with a low stan- 
dard deviation. However, the rather large variability observed 
upon repeating this experiment prevented us from drawing any 
conclusion (the inconsistent results observed at t > 1 day may 
be due to the fact that cells reached confluency at day 1). 

10 9 1 



10 e n 

p 10 
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hours 

Fig. 6. Reporter protein activity appears faster with CMVLuc- 
NLS. HeLa cells were transfected with 10 ng of DNA complexed to 
ExGen 500 PEI (N/P = 5 in 5% glucose solution) in the absence of 
serum. Cells were lysed and luciferase activity was measured at the 
indicated time after transfection. The absence of error bars indicates 
that the SEM is smaller than the label on the graph. 
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DISCUSSION 

The goal of this work was to improve nonviral plasmid- 
mediated gene delivery. When fully counterion-condensed, a 
single plasmid molecule collapses into a sphere of about 25 nm 
(26) that is already at the size-exclusion limit for signal- 
mediated nuclear import (4, 27). Plasmid condensation with 
cationic lipids or polymers generally leads to even larger 
multimolecular aggregates that reach the cytoplasm after 
binding to cell-surface anionic proteoglycans (9, 28) and 
eventual escape from the formed vacuoles (8, 9, 30). Because 
the particles are too large to cross an intact nuclear membrane, 
transfection of resting cells can reasonably only be accounted 
for by uncomplexed DNA that has been released by exchange 
with phosphatidylserine (31) or heparan sulfate (9) present in 
the vacuolar membrane. Any molecular information (e.g., 
NLS) borne by the cationic vector will thus be lost before 
reaching the nuclear membrane, hence its ineffectiveness. 

In sharp contrast to a condensed DNA molecule, a free 
hydrated DNA double helix (3 nm) is thin enough to enter the 
nuclear pore by diffusion, i.e., with no energy or signal 
requirement. Restricted intracellular motion of DNA [the 
cytoplasm is equivalent to 13% dextran (32)], and a rather low 
nuclear pore coverage [<10% of the membrane surface (33)], 
however, give this event a low probability to occur: the nuclear 
membrane has been recognized as a major barrier to gene 
delivery (8, 9). Quantitative cytoplasmic microinjection of 
DNA (25, 34, 35) indeed led to less than 0.1% of the DNA 
being expressed. The probability for DNA to find and subse- 
quently enter a nuclear pore can be increased by a bound ; 
karyophilic signal peptide able to dock DNA to the nuclear 
pore filaments and help an initial part of the molecule to cross 
the membrane. The DNA-karyophilic signal link should, 
however, be stable, hence the chemical strategy we chose to use 
in this study. 

The major achievement of our approach is that the amount 
of DNA required to effectively transfect cells in vitro has now 
shifted from the microgram to the nanogram range (see also 
ref. 17). Although the improvement (one to three orders of 
magnitude) varies with cell type (showing that the nuclear 
membrane is not the only possible barrier), even the modest 




Fig. 7. Hypothetical scheme of foreign DNA entering the nucleus. 
The vectorial movement is driven by DNA binding to chromatin- 
forming basic proteins. A single NLS peptide helps initial threading, 
whereas too many signals may inhibit full translocation. 
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enhancements observed for primary cell cultures look inter- 
esting (1,000%) when not presented on a logarithmic scale. Is 
this exciting result really a consequence of the involvement of 
the nuclear import machinery? We choose to tackle the 
academic aspect of this work by using the lysine to threonine 
mutation, which indeed abolished the beneficial effect brought 
about by the DNA-bound NLS sequence. 

Taking our hypothesis further, the last unanswered question 
deals with threading of the rest of the plasmid molecule (Fig. 
7). Random-walk diffusion of a micrometer-long molecule 
would be inefficient and slow. Fortunately, naked DNA will 
not remain free in the nucleus: histones (and eventually basic 
nuclear matrix proteins) indeed quickly assemble transfected 
DNA into chromatin-like structures (36, 37), thus providing a 
mechanism for pulling and condensing the filamentous mol- 
ecule into the nucleus. Following this line of thinking, many 
NLS signals distributed along the DNA may actually inhibit 
nuclear entry if the nucleic acid is longer than the distance 
separating adjacent pores (Fig, 7). A straightforward calcula- 
tion that uses the pore density in HeLa cells (33) shows this to 
become probable above 1 kbp. 

The present work is a proof of principle rather than a new 
gene delivery technology. However, it will hopefully provide 
the impetus for searching more versatile solutions to the link 
between a plasmid and a NLS peptide, such as triple-helix 
forming (38, 39) or strand-invading (40, 41) oligonucleotide- 
NLS conjugates. 
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Summary 

A short sequence of amino acids including Lys-128 
is required for the normal nuclear accumulation of 
wild-type and deleted forms of SV40 large T antigen. 
A cytoplasmic large T mutant that lacks sequences 
from around Lys-128 localizes to the nucleus if the 
missing sequence is attached to its amino terminus. 
The implication that the sequence element around 
Lys-128 acts as an autonomous signal capable of 
specifying nuclear location was tested directly by 
transferring it to the amino termini of /9-galactosid- 
ase and of pyruvate kinase, normally a cytoplasmic 
protein. Sequences that included the putative signal 
induced each of the fusion proteins to accumulate 
completely in the nucleus but had no discernible 
effect when Lys-128 was replaced by Thr. By reduc- 
ing the size of the transposed sequence we con- 
clude that Pro-Lys-Lys-Lys-Arg-Lys-Val can act as 
a nuclear location signal- The sequence may repre- 
sent a prototype of similar sequences in other nu- 
clear proteins. 

Introduction 

The nuclear and cytoplasmic compartments of eucaryotic 
cells contain largely distinct sets of proteins (Bonner, 1 975; 
De Robertis et al., 1978). Studies of the fate of endoge- 
nously synthesized and exogenously introduced proteins 
suggest that all nuclear proteins are synthesized in the 
cytoplasm, from which the mature polypeptides migrate 
rapidly to the nucleus (Gurdon, 197D; Craig and Perry, 
1971; Wu and Warner. 1971; Bonner, 1975; Feldherr, 1975; 
De Robertis et al., 1978; Yamaizumi et al., 1978), Thus the 
steady-state segregation of proteins between nucleus and 
cytoplasm appears to be governed by an intrinsic property 
of mature polypeptides. How the structures of proteins 
specify their distribution between the two compartments 
is not known. 

Passage of proteins across the membranes of the en- 
doplasmic reticulum, of mitochondria, or of chloroplasts is 
dictated by signal sequences of 20-40 contiguous amino 
acids (Btobel, 1980; Watson, 1984). Specific signals are 
not, however, necessarily required for passage across the 
nuclear envelope. For instance, many large polysaccha- 
rides and several proteins that are normally secreted or 
membrane-bound have been observed to enter the nu- 
cleus following introduction into the cytoplasm, and to 
enter at a rate apparently governed only by their size 
(Paine and Feldherr, 1972). Thus passage of proteins 



below M r 15,000 is not detectably retarded, a globular 
protein of M r 67,000 (BSA) enters nuclei only very slowly, 
and a protein of M r 450,000 (ferritin) is excluded (Paine et 
al., 1975). From such studies, the nuclear envelope has 
been proposed to behave as an array of cylindrical pores 
of internal diameter 7-11 nm (Paine et at., 1975; Peters, 
1 984). This model of the envelope corresponds reasonably 
well with the dimensions of discrete structures called 
nuclear pores, which have been purified from the nuclear 
envelope and analyzed by electron microscopy (Harris, 
1981; Unwin and Miiligan, 1982). 

It appears therefore that in vivo many small proteins may 
diffuse freely between nucleus and cytoplasm and accu- 
mulate in either compartment due to selective retention or 
differential degradation. However, some means of entry to 
the nucleus other than diffusion must exist for large pro- 
teins, notwithstanding the possibility that some such pro- 
teins might invade the nuclear area during mitosis, at which 
time the nuclear envelope breaks down transiently in most 
eucaryotes. It seems likely, therefore, that access of the 
majority of large proteins to the nucleus requires that they 
participate in a specific interaction with a cellular compo- 
nent involved in transport of proteins to the nucleus. The 
specificity of this interaction could then account for the 
selective distribution of such proleins between nucleus 
and cytoplasm. 

To date, two approaches have been taken to search for 
nuclear location signals in proleins. One approach as- 
sesses which parts of a protein are required for nuclear 
localization and is best exemplified by studies on the 
subcellular location of partial proteolytic digestion products 
of the nuclear protein nucleoplasm^ (Dingwall et al., 1 982). 
The other approach seeks to determine which parts of a 
nuclear protein are sufficient to promote nuclear accumu- 
lation. Thus, Hall and co-workers constructed chimeric 
fusion proteins to define which amino acid sequences of 
the yeast Mat o2 gene product sufficed to target E. coir 
/S-galactosidase to the nucleus in yeast (Hall et al., 1984). 
Here we have combined these approaches to demonstrate 
that a very short sequence of amino acids derived from 
SV40 large T is required for its normal nuclear localization 
and promotes the nuciear accumulation of an otherwise 
cytoplasmic protein, pyruvate kinase and also of a bacterial 
protein, 0-galactosidase. 

Results 

Sequences Required for Nuclear Location of 
Large T 

It has previously been shown that alteration of Lys-128 of 
SV40 large T to Thr or Asn (Kalderon et al., 1984; Lanford 
and Bute!, 1 984) prevents the normal nuclear accumulation 
of large T in established rodent and simian cells without 
grossly altering the stability of the prolein or its associated 
biochemical activities (Lanford and Butel, 1980; Paucha et 
al., submitted). Alteration of each of the basic residues 
adjacent to Lys-128 impaired but did not prevent nuclear 
accumulation. Other mutants with deletions bordering the 



basic tract of amino acids around Lys-128 were wholly or 
predominantly nuclear, leading to the suggestion that a 
very short sequence of amino acids including Lys-1 28 may 
act as an independent element within large T and be 
responsible for bringing about its normal nuclear localiza- 
tion (Kalderon etal., 1984). 

Here we have tested this hypothesis by constructing a 
series of mutants encoding proteins that lack sequences 
from the normal amino terminus or carboxy terminus, or 
from the regions on either side of Lys-128 in the primary 
structure of large T (Figure 1 and section 1 of Experimental 
Procedures). 

The subcellular localization of variant large T proteins 
was examined by immunofluorescent staining 18 hr after 
needle microinjection of plasmid DNA into Vero cells and 
also, for those mutants still able to transform Rat-1 cells 
morphologically, in cell lines derived from isolated trans- 
formed dense foci. The results showed that nuclear ac- 
cumulation was not affected by deletion of amino acid 
residues 1 to 82 (and perhaps 1 to 108) or 136 to 250 and 
was only slightly impaired by deletion of residues 98 to 
126 (Figure 2). Furthermore, proteins truncated from the 
carboxy terminus as far as residue 178 were also located 
wholly in the nucleus. This set of mutants therefore collec- 
tively deletes most of the primary sequence of large T 
while sparing the element around Lys-128 without visibly 
impairing the ability of the mutant proteins to accumulate 
in nuclei. 

However, as mechanisms of nuclear accumulation may 
differ among proteins (De Robertis, 1983) it is possible 
that wild-type large T and mutant forms of the protein 
distribute between nucleus and cytoplasm by different 
means. The nuclear accumulation of wild-type large T 
appears to depend on a mechanism involving Lys-128 
{Kalderon et al.. 1984). To test whether the observed 



subcellular location of each of the large T variants was 
also dependent on this mechanism, the location of equiv- 
alent proteins that included Thr rather than Lys at position 
128 was examined (sections 1 and 2 of Experimental 
Procedures). Several of these Thr-128-containing mutants 
gave rise to proteins that were detected only in the cyto- 
plasm (Figure 2), implying that the Lys-128 versions of the 
proteins depend on the same mechanism as wild-type 
large T for nuclear location. 

By contrast, the 272 amino acid long truncated version 
of d10 large T (encoding Thr at 128) accumulated in the 
nucleus to almost the same extent as its wild-type coun- 
terpart. Indeed, every other truncated version of d10 large 
T that we tested (of lengths 361, 437, ca. 540, and ca. 
620 amino acids), although predominantly cytoplasmic, 
was present to some extent in the nucleus. It is therefore 
clear from the difference in the distribution of d10 and wild- 
type versions of the truncated proteins that the Lys-1 28- 
dependent mechanism is operative in each case but it also 
appears that an additional means of nuclear accumulation 
is available to these proteins. It may be that the reduced 
size of the variants allows them to diffuse across the 



Thus a Lys-1 28-dependent mechanism contributed 
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Figure 1. Linker hserton/Delelion Mi 
The distribution of native restriction enzyme sites in the SV40 large T gene 
and 1he positions o( Insertion of a 10 bp Eco Rl Siter (CGGAATTCOG) in 
es of the Wild-type gerie.are showri. The insertion sites are 
. case denoted as the first SV4Q nucleotide preceding or fotowing 
er, in the numbering system of Tooze (1981). These enzyme sites 
sed as described rri section 1 of Experimental Procedures to 
ict ptesmids encoding deleted, duplicated, or Ironcated ve 
. Note that RL52 and 60 are not pn 



Figure 2- Structure and Subcellular Location of Large T Deletion Mutants 
The derivation of all plasmids whose coding potential and properties are 
displayed above is described in sections 1 and 2 of " 

by fhe mutant plasmids is shown. Thus the linker insertion/deletion mutants 
also include three or four amino acids encoded by the Eco Rl linker 
(CGSAATTCCG) inserted at the sites of the deletions shown. TL FVu II also 
encodes one and TL 148 two additional amino adds before termination of 
translation. The location of all mutant proteins was ascertained by indiect 
immunofluorescence as described previously (Kalderon et si., 1S64) using 
as first antibody either a monodonai that recognizes aminoterminal 
(PAb419) or carboxy-terminai (PAb423) regions of large T (Harlow et al., 
1981) 18 hr alter needle mrcrainjeclion of DNA into Vero ceffe. For those 
plasmids marked with an asterisk, the distribution of mutant protein was 
also examined by indirect irnmunofiuorescence using hamster anti-SV40 
tumor serum as first antibody in Rat-1 ceils transformed by »ie mutant 
protsris. The symbols N and C refer to complete nuclei 
localization; ~N indicates that the majority of protein is se 
Th only a trace in the cytoplasm; N>CandC>Nh< 
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either wholly or in part to the nuciear accumulation of all 
the deleted and truncated forms of large T examined. 
Because it operates in the context of so many variant 
structures, this result suggests that the sequence around 
Lys-128 does not rely on an interaction with other parts of 
the molecule to ensure the nuclear location of large T. 

Nuclear Accumulation of Large T Is Positively 
Promoted by Sequences Including Lys-128 

It has been assumed above that Lys-128 forms part of a 
positive signal that promotes nuclear localization, rather 
than Thr-128 inducing cytoplasmic retention of large T. 
The latter possibility seemed unlikely because a number 
of deletion mutants that remove Lys-128 and adjacent 
sequences also share the cytoplasmic phenotype (Kalde- 
ron et al.. 1984). Nevertheless, this possibility was exam- 
ined directly by constructing large T variants that contained 
two successive copies of a short sequence (amino acids 
127 to 135) around residue 128 with either Thr or Lys at 
this position (Figure 3). In every case where a wild-type 
sequence (including Lys at 128) was present, the variant 
protein localized exclusively to the nucleus, irrespective of 
whether this sequence was preceded or followed by se- 
quences derived from dlO large T (Figures 3, 4a, 4c, 4d). 
The same result was observed when the size of the 
duplication was increased to 21 amino acids (127 to 147), 
although in this case only one of the two orientations of 
wild-type relative to mutant sequence was examined (data 
not shown). Thus the presence of Thr-1 28 in this position 
relative to the normal amino or carboxy terminus of large 
T does not necessarily induce cytoplasmic retention, 
whereas nuclear accumulation is promoted in every case 
when Lys occupies position 1 28. 

The unimpaired nuclear accumulation of molecules with 
small duplications (Figure 4a, 4c, 4d) provides further 
evidence, similar to that derived from analysis of deletion 
mutants, that the function of the region around Lys-128 is 
not critically dependent on its position relative to the rest 
of the molecule. 

Relocation of Putative Nuclear Location Signal to 
the Amino Terminus of Large T 

The insertion/deletion mutant S11-S33 lacks native large T 
amino acid residues between 127 and 132 and fails to 
localize to the nucleus (Kalderon et al., 1984). We sought 
to restore a nuclear location to this protein by introducing 
at the amino terminus a short sequence of amino adds 
from around Lys-128 in wild-type large T (Figure 3 and 
section 4 of Experimental Procedures). Again, an analo- 
gous construction was made in which Lys-128 was re- 
placed by a Thr residue. The subcellular distribution of 
these proteins was examined foSowing microinjection of 
plasmid DJMA into Vero cells by immunof luorescent staining 
using a monoclonal antibody directed against the amino 
terminus of large T PAb419 (Harlow et al., 1981). Virtually 
all of the chimeric protein with an amino-terminal basic 
tract including Lys-128 was seen in the nucleus, whereas 
the Thr-containing large T variant was entirely cytoplasmic 




Figure 3. Structure and Subcellular Location of Large T Variants Containing 
Two Copies or One Relocated Copy of a Short Sequence around Residue 
128 

The construction of plasmids encoding large T variants in which sequences 
around Lys-128 were (a) duplicated or (b) transposed from their normal 
location to the amino terminus is described in sections 1, 3, and 4 of 
Experimental Procedures. The amino add sequence following Met-36 of 
L7RH/S11-S33 is identical wfji that following Met-1 of the linker insertion/ 
deletion mutant, S11-S33 (which lacks residues 127 to 132). The bcaton 
of variant proteins was assessed following microinjection of plasmid DNA 
into Vero cells by indirect immunofluorescence microscopy (see Figure 4). 



(Figures 4e and 4f). Thus a short sequence around Lys- 
128 is able to confer a nuclear location to large T even 
when moved by over a hundred amino acids within the 
primary structure. 

The tolerance of this sequence to transposition suggests 
that it constitutes an independent structural element but 
does not rule out the alternative possibility that its function 
depends on an interaction with other regions of the large 
T molecule. The hypothesis thai the sequence acts as an 
autonomous nuclear localization signal was therefore 
tested more rigorously by ascertaining whether it is suffi- 
cient to promote the nuclear accumulation of proteins that 
are normally found in the cytoplasm. Plasmids encoding 
chimeric proteins were therefore constructed in which the 
putative nuclear location signal was close to a terminus of 
the test protein where, it could be argued, there is most 
chance of maintaining a structure that is independent of 
the rest of the molecule. 

Localization of Chimeric fl-Galactosidase Proteins 

(3-gafacfosidase, encoded by the lac Z gene of- E. coli, 
was initially selected as a suitable test protein for several 
reasons. As a bacteria' protein it would not be expected 
to include specific signals that facilitate transport across 
the nuciear envelope, although it may have an affinity for 
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Figure 4. Subcellular Locations of Large T Variants Containing two Copies, or One Relocated Copy ot a Short Sequence around Residue 128 
Vera cells were micro-injected with plasmid DMAs encoding the variant large T molecules shown in Figure 3. Fctowing microinjection, cells were fixed and 
stained using mmodonals PAb423 (a to d) or PAb419 (e and f) (Harlow et el. 1981) as first antibody. The second antibody was rhodamine-bbeted rabbit 
anti-mouse IgG (Fabi fragment). Micrographs a to d represent large T variants containing two consecutive copies of the amino acid seqeunoa from 127 to 
135 with either Lys or Thr at position 12S, as follows (a) Lys-128 in both copies, L27-L7; (b) Thr-12Bin both copies, d10/L27-d10/L7; (C) Lys-128 followed by 
Thr-126, L27-dl 0/L7; (d) Thr-128 followed by Lys-128, d10/L27-L7. Micrographs (e) and (f) show the localiza:bn of large T derivatives in which amino acids 
127 to 147 including either Lys or Thr at position 128 have been added to the amino terminus of the cytoplasmic mutant S11-S33 as follows: (e) Lys at 
position 128, L7RH/S1 1 -S33;. (t) Thr at position 1 28, d1 0/L7RH/S1 1-S33. 



nuclear components common to procaryotic and eucar- 
yotic cells. The large size of the monomeric protein {M, 
116,000; Fowler and Zabin, 1977) and its propensity to 
oligomerize (Wallenfels and Weil, 1972) might be expected 
to preclude rapid entry to the nucleus in the absence of a 
specific transport mechanism. Furthermore, of the p-galac- 
tosidase fusion proteins thai have previously been con- 
structed, many retain enzyme activity (Casadaban et al., 
1983), indicating that additional amino acid sequences can 
be accommodated at either the amino or carboxy terminus 
of the protein without gross effect on its native structure. 
Finally, the same approach has been used previously to 
identify sequences that localize /3-gaiactosidase to the 
nucleus in yeast (Hall et al., 1984). 

Initially, four types of plasmid encoding /J-galactosidase 
fusion proteins were made (section 5 in Experimental 
Procedures). All contained the same promoter, translation 
initiation, and poiyadenylation signals (all derived from the 
SV40 early region) and the same jfJ-galactosidase coding 



sequence (Figure 5). They differed only in the number of 
SV40 large T amino acids that preceded the fusion junction 
with 3-galactosidase. The subcellular location of each of 
the fusion proteins was examined, following microinjection 
of the appropriate plasmid DNA into Vera cells, by indirect 
immunofluorescence using a cocktail of four monoclonal 
antibodies raised against E. coli /3-galactosidase (a gift 
from Dr. J. Partaledis). 

Surprisingly, the hybrid protein that included only four 
amino acids from the amino terminus ol SV40 large T 
(SRL30-BGAL, Figure 5) was seen at approximately equal 
intensities in both nuclear and cytoplasmic compartments. 
We obtained the same result with a hybrid protein, pCH1 1 0 
(Hall et al., 1983) that included 40 amino acids encoded 
by the E. coli gpt gene and 28 by the trp S gene at the 
amino terminus of jS-galactosidase. Similar results have 
been obtained by others for a series of amino- and car- 
boxy-terminal /8-galactosidase fusion proteins (D. P. Lane, 
personal communication). Thus, despite its large size and 
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Figure 5. Structure and Subcellular 
Plasmids encoding 




bacterial origin, E. coli /3-galactosidase appears intrinsically 
able to localize, at least partially, to the nucleus of higher 
eucaryotic cells. 

A similar distribution between cytoplasm and nucleus 
was seen for the fusion protein that includes 116 amino- 
terminal residues of SV40 large T. An additional 19 amino 
acids (up to 135), however, led to a strikingly different 
distribution. All of the fusion protein was now seen in the 
nucleus with no evidence of cytoplasmic slaining (Figures 
5 and 6a). An equivalent fusion protein that contained Tnr 
in place of Lys-128 was, by contrast, again found to be 
distributed between nucleus and cytoplasm (Figures 5 and 
6b). From this it appears that the conversion of a fusion 
protein consisting predominantly of /J-galactosidase from 
a partially nuclear to a totally nuclear phenotype can be 
achieved by sequences derived from SV40 large T, pro- 
vided Lys-128 and the immediately adjacent residues are 
present. 

The possibility that the same effect could be achieved 
by the sequence around Lys-128 in isolation was tested 



by inserting a small region of the SV4Q large T gene just 
downstream of the normal large T initiation codon and 
before the codon for amino acid 6 of /3-galactosidase 
(Figure 5). Again, the fusion protein was entirely nuclear if 
this short sequence (amino acids 127 to 147 of large T) 
included Lys at position 128 but was seen in both the 
nucleus and the cytoplasm if Thr replaced Lys-1 28 (Figures 
5, 6c, 6d). 

The /?-galactosidase fusion proteins clearly demonstrate 
the ability of a very short amino acid sequence including 
Lys-128 to promote the nuclear accumulation of that frac- 
tion of the protein normally found in the cytoplasm. How- 
ever, /3-galactosidase is somewhat unsatisfactory in this 
assay because the native protein (or, strictly, slight variants 
thereof), is not completely excluded from the nucleus. 

Localization of Pyruvate Kinase Fusion Proteins 

A series of experiments similar to those described above 
were conducted using chicken muscle type M1 pyruvate 
• kinase in place of /3-galactosidase. This protein shares 
some of the favorable characteristics listed above for &■ . 
galactosidase: the monomelic protein is reasonably large 
(theoretical M, = 58,000; Lonberg and Gilbert, 1983), the 
active form is teframeric (Cardenas et al., 1975), and 
antibodies are available that recognize the protein. Addi- 
tionally, the crystal structure of cat muscle pyruvate kinase 
has been determined (Stuart et al, 1979) and reveals thai 
the amino terminus of the protein is exposed and therefore 
provides a suitable environment for a putative nuclear 
location signal. 

A cDNA encoding chicken muscle pyruvate kinase (Lon- 
berg and Gilbert, 1983) was engineered so as to include 
the SV40 early region polyadenylation sequence following 
its normal termination codon and a unique Eco Rl site 1 7 
amino acids distal to the normal amino terminus of the 
protein (section 6 in Experimental Procedures). To this Eco 
Rl site were joined the SV40 early promoter and sequences 
encoding the first 4, 116, or 135 amino acids of SV40 
large T. The subcellular location of the pyruvate kinase 
fusion proteins was examined by indirect immunofluores- 
cence following microinjection of DNA into Vera cells, using 
an antibody raised against chicken muscle type M1 pyru- 
vate kinase (Lonberg and Gilbert, 1983). The fusion pro- 
teins that included 4 or 1 1 6 amino acids derived from large 
T (Figure 7) were detected throughout the cytoplasm but 
were absent from the nucleus of injected cells. In stark 
contrast, the fusion protein that included large T amino 
acids up to 135 was seen only in the nucleus of injected 
cells (Figure 6e). Only a cytoplasmic signal was seen if a 
Thr residue replaced Lys-128 in this construct (Figure 6f). 

The clear inference from these observations is that the 
same short sequence derived from SV40 large T that 
accentuates the nuclear localization of /3-galactosidase 
fusion proteins is responsible for changing the normal 
subcellular location of another protein, pyruvate kinase 
from the cytoplasm of the cell to the nucleus. This predic- 
tion was then tested directly and, at the same time, 
approximate limits were set on the length of large T amino 



6-galactosidase 



pyruvate kinase 




- ii jjP 



e m § * 

9 


5 * # 

* ** * 

* • • 










m 


■ 

* - * 







Figure 6. Subcellular Location of 0-Galaclosidase and Pyruvate Kinase Fusion Proteins 

Vero eels were microinjecled with plasmid DMAs encoding -fusion proteins ikistrated in Figure 5 (/J-galaorosiaase fusions) sold Figures 7 and 8 (pyruvate 
kinase fusions). Eighteen hours later the cells were fixed and the fusion proteins were visualized by imrnunofljorescence microscopy. Micrographs (a) to (d) 
represent /JgaJactosidase fusions, visualized with a mixture of munoclonals directed against jS^alactosidase fallowed by rrKjdamine-conjugated rabbit anti- 
mouse IgG. as follows, (a) L27-BGAL, (b) d10/L27-BGAL, (c) L7RH-BGAL, (d) d10/L7RH-BGAL (see Figure S). Micrographs (e) to (I) show pyruvate kinase 
fusions stained with a serum raised in rabbits against purified chicken muscle pyruvate kinase, and couiter-stamed with tluorescein-labeled goat anti-rabbit 
serum, as folows, (e) L27-PK, (f) d1 0/L27-PK. <S) XR4-PK, (h) XRdf 0/L27-PK, ® XR3WK, (j) XR22-PK, (k) m30-PJ-B, (1 ) xm30-PK (see Figures 7 and B). 

signals at their amino termini was, in principle, the same 
as that used for 0-galactosidase. End points, defined by 
restriction enzyme cleavage sites, were created in each of 
the three possible reading frames both following the amino- 
terminal five residues of iarge T and preceding residue 17 
of pyruvate kinase. Thus any small fragment of known 
nucleotide sequence could be inserted between appropri- 



Large T Amino Acid Sequences Sufficient for 
Nuclear Localization of Pyruvate Kinase 

The method used to construct plasmids encoding pyruvate 
kinase fusion proteins with short putative nuclear location 
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Figure 7. Structure and Subcellular Location of Pyruvate Kinase Fusion 

Plasmids encoding a variable number of amino acid residues derived from 
the amino terminus of large T fused fo pyruvate kinase {see sections 6 and 
7 of Experimental Procedures for ther construction] were microinjected Ho 
Veto cells. The drstribution o) fusion proteins was determined 18 hr later by 

chicken muscle type M1 pyruvate kinase followed by fluorescein-labeled 
goat antkaboit antibody. Although some background fluorescence was 
seen in uninfected cells, the fusion proteins were dearly either cytoplasmic 
(C) or nuclear (N) (see Figure 6, e-l). 

ate end points such that the normal large T and pyruvate 
kinase reading frames were maintained across both fusion 
junctions (details in Section 7 in Experimental Procedures). 
The small fragment was generated in the first instance by 
cloning the Hint I (4442)-Eco Rl (at 4413) fragment encod- 
ing either Lys or Thr at position 128. Expression of these 
proteins in Vero cells showed that amino acids 119-135 
of SV40 large T sufficed to promote complete nuclear 
accumulation of the pyruvate kinase fusion protein, 
whereas a similar sequence including Thr in place of Lys- 
128 showed no such capability (Figures 6g, 6h, and 7). 

The length of large T amino acid sequence preceding 
residue 135 was then progressively reduced by exonucle- 
ase digestion of the wild-type Hint I (4442)-Eco Rl (4413) 
fragment before insertion into the appropriate pyruvate 
kinase fusion vector. The amino acid sequences of these 
fusion proteins and their subcellular location are shown in 
Figure 8. SV40 large T amino acids from 126 to 135 (XR30 
FK, XR27 PK) promoted efficient nuclear accumulation of 
pyruvate kinase (Figure 6i) while 1 32 to 135 (XR22 PK) did 
not discernibly alter the cytoplasmic location characteristic 
of native pyruvate kinase (Figure 6j). We did not, however, 
observe a sharp transition between these two extremes. 
XR12 PK, which encodes iarge T amino acids 129 to 135 
gave rise to a protein that distributed between nucleus 
and cytoplasm. Exact interpretation of the large T amino 



acid sequence sufficient to act as a signal is complicated 
by the contribution that may be made to this activity by 
adjacent amino acids encoded by linker DNA sequences. 
Note in particular that the residue preceding 129 in XR 12 
PK is Arg, which is the only amino acid that has so far 
been found to be able to replace Lys-1 28 in large T without 
totally preventing accumulation in the nucleus (W. H. Col- 
ledge, personal communication). 

The distal end point of the nuclear location signal was 
refined further by taking advantage of an adjacent Mbo II 
site in order to introduce an Eco Rl site following the 
nucleotides encoding Val-132 (Section 7 of Experimental 
Procedures). Even this short signal extending from amino 
acid 126 to 132 was lound to be sufficient for nuclear 
accumulation (Figures 6k and 8). Exonuclease digestion 
from the Mbo II cleavage site allowed the generation of 
one more fusion protein that included large T sequences 
from resiude 126 to 130 and which was found to localize 
exclusively in the cytoplasm (Figures 61 and 8). Thus, taking 
into account possible contributions from linker sequences 
(see underlined residues in Figure 8), we believe that the 
minimal sequence of SV40 large T amino acids that suf- 
fices to promote efficient nuclear accumulation of pyruvate 
kinase is most probably 1Z6 Pro-Lys-Lys-Lys-Arg-Lys-Vai 13Z . 

Discussion 

Isolation of a Nuclear Location Signal 

Evidence has been presented to show that a short se- 
quence of amino acids around Lys-1 28 is required for the 
normal nuclear localization of SV40 large T and can suffice 
when appropriately integrated into the structure of cyto- 
plasmic proteins to promote their accumulation into the 
nucleus. This nuclear location signal fails to function in its 
norma! position in large T, or at the amino terminus of large 
T, 0-gatactosidase, or pyruvate kinase fusion proteins 
when Lys-1 28 (of large T) is replaced by a Thr residue. 
The signal therefore satisfies the condition of specificity 
that would be expected in order that distinct populations 
of proteins be maintained in cytoplasmic and nuclear 
compartments. 

it seems likely that the sequence of amino acids includ : 
ing Lys-1 28 itself constitutes a structure that is directly 
recognized as a nuclear location signal rather than inducing 
changes in the conformation of the rest of the polypeptide 
chain to which it is attached, since it has been shown to 
direct three different proteins to the nucleus- 
Sequence of Nuclear Location Signal 
The length of amino acid sequence sufficient to act as a 
nuclear location signal for pyruvate kinase has been deter- 
mined in this study to be approximately seven amino acids 
(126-132). This agrees reasonably well with estimates of 
the sequence requirements for the nuclear location of large 
T, as judged by the properties of both deletion and point 
mutants (Kalderon et al., 1984, and unpublished results), ' 

We already know from an analysis of the subcellular 
location of point mutants of large T that variations in the 
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Figure 8. Definition of the Minima) Nuclear Location Signal 

The entire an-iran-n in inn tijf l t cr with pyruvate kinase sequences is shown for fusion proteins that Include only short regions of large T 
sequence. The sequence that is directly derived from the region around Lys-128 in large-T is boxed in each case. Residues encoded by the linkers fhat 
surround this sequence are underlined where they are, fortuitously, identical with the amino add residue found in the equivalent position in large T. Most fusion 
proteins were either clearly cytoplasmic (G) or nuclear (N) but one indicated as N/C was detected in both compartments. 



minimal nuclear location signal can t 
without complete loss oi function and that the residue 
occupying position 128 is of prime importance (Kalderon 
et a!., 1984 and unpublished results). Definition of the 
exact parameters to which this sequence must conform 
should heip to elucidate the structural basis of the action 
of this signal and help to identify the occurrence of poten- 
tially active signals in proteins other than SV40 large T. In 
preliminary searches of other protein sequences we have 
found several nuclear proteins (e.g., adenovirus 72 kd 
DNA-binding protein, SV40, and polyoma capsid proteins) 
that include similar sequences to the large T prototype but 
only one (BK virus large T) that is identical. The functional 
activity of such homologous sequences may depend not 
only on their composition but also on their location within 
the protein. It will therefore be instructive to examine the 
efficacy of the large T signal when integrated at different 
sites within a protein of known tertiary structure. 

Is the Nuclear Location Signal a Nuclear Transport 
Signal? 

The existence of nuclear location signals was originally 
postulated to account for the transport of a specific subset 



of large proteins to the nucleus (De Robertis et a!., 1978). 
Although the nuclear location signal described here may 
fulfil this function we have defined its activity only by 
obsetving its effect on nuclear accumulation. Further ex- 
periments outlined below designed to test whether the 
signal accentuates the nuclear concentration of proteins 
by promoting transport are less conclusive. 

SV40 large T and the fusion proteins described above 
are sufficiently large that they would not be expected to 
accumulate in the nucleus as rapidly as observed, in the 
absence of a specific transport mechanism. An effect on 
transport is also indicated by the reduced rate of nuclear 
accumulation of large T variants with defective signals 
(Kalderon et al., 1984). Also, although it is not clear whether 
an association with a fixed nuclear component contributes 
to the accumulation of wild-type targe T in the nucleus, it 
is certain that the failure oi d10 large T to accumulate in 
the nucleus is not due to an inability to bind to SV40 origin 
DNA or the cellular protein, p53, since we know that it 
retains these activities (Paucha et al., submitted). 

Thus, although not compelling, the available evidence 
favors the possibility that the Lys-128 nuclear location 
signal acts by promoting transport to the nucleus of pro- 
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teins that are otherwise too large to penetrate the nuclear 
envelope. 



ty 

Large T in Wild-Type andrJJO (Lys-120-*Thr) Backgrounds 

Plasmids encoding large T variants were generated by using native restric- 
tion sties of the large T ganB and derivatives (Kakjeron and Smith, 1984) 
o( the wild-type gene in which a 10 bp Eco R linker has been inserted at 
the positions shown in Figure 1, 

Large T derivatives that encode two adjacent copies of the sequence 
from 127 to 135 separated by the coding information of an Eoo Rl Snker 
were constructed by ligation of the large Bam HI (2533)-Eeo Rl (4413) 
fragment of L27-S26 or d10/L27-S26 to the small Bam Hi (2533)-Eco Rl 
(4441) fragment of T23-L7 or T23<i10/L7. 

A synthetic self-complementary oligonucleotide of sequence CTAGT- 
TAACTAG that Includes an Hpa I site and encodes a termination oodon in 
all three reading frames, was inserted at the Pvu II site of the wild-type gene 
clones in plasmid pFWO (Kalderon et al., 1982) and of the mutant plasmid, 
d10 (Kalderon et at, 1984). The termination Snker was also inserted at the 
Hpa I site (3733) in both wild-type and d10 plasmids after addition of an 
Eco Rl linker to this site. Fnally, the termination linker was inserted into the 
Eco Rl sites of RL148, RL62. and RL50, ard for RL62 and RL60 was 
transferred to a d10 background by exchanging the Pvu ll-Bsm Hi fragment 
A termination codon was created at the Hind N site at 4002 as for the 
mutant dl 1001 (Rundell et al., 1977) by excision of the Hind 1114002-3476 
fragment from pPVU-0 and from d10, creating the sequence 4007 AAG CTT 
TAA™. Plasmids, referred to as EXON2 and d10/EXON2. theoretically 
encoding large T derivatives which initiate at Met-109 and which certainly 
include no information from the first exon of targe T, were made by addition 
of an Eco HI Inker to the Taq I (4739) sites of pPVUO and d1 0, respectively, 
n of this site to the Eco Ftl site of plasmid RL18, thus 
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The linker ire 

rh the d10 point m 
heteroduptex was formed between I 27526 and d10 as described previ- 
ously (Kalderon et al., 1982) and was treated with the minimal activity of 
S1 nuclease (about 10 u>g DNA/20 ^1) required to nick one or other but 
not both of the single-stranded loops in ttie heteroduptex. The criterion 
used in practice was the conversion of less than one quarter of the circular 
heteroduptex to a linear form. Following transfection into E08767 the nicked 
strands are removed and the gap repaired by copying the intact strand. 
Folowing resolution of the strands of the heteroduptex, the resulting 
molecules consist of one or other of the parental plasmids, wild-type 
plasmid, or the desired recornbinant. The cclonies were screened for the 
required plasmids containing both an Eco Ri site in the appropriate position 
and the d10 sequence at position 4435, by hybridization of Southern blots 
(Southern, 1975) of Pst l-Eco Rl-digested plasmid DNA to oligonucteotide 
probe of wild-type sequence from 4441-4423 at increasing temperature 
(Kalderon et al., 1984). Approximately one third of plasmids were found to 
be of the required structure. This was confirmed by DNA sequence 
Is T23-dt 0/L7 and dt0/L27-S26. This method is 
' " m of deletion and point ' 
>n of restriction sites and has also 
ping deletions. 

id T50-d1Q/L7 was made by replacing the -1800 bp Bam HI 
(2533)-Eco HI (4441) fragment of T50-L7 with the equivalent fragment from 
T23-dt0/L7. 

The Bam HI <2533)-Eco Rl (5150) fragment of RL142 (Figure 1) includes 
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promoter, and encodes the first four amino acid rt 
to SV40 small t and large T, The exact position of the Eco Rl site in RL142 
was refined by mild S1 nuclease digestion after cleavage with Eco Rl, 
followed by the addition of another (10-mer) Eco Rl linker to the newly 



s SR140 and SRL30 Foduced in this way 
aides 5154 and 5149, respectively 
le 5149 is now a C residue demmg from the Eco Rl linker 
in RL142). Thus DNA fragments ligated to the Eco Rl site can be read in 
each of the three posable reading frames by using the appropriate vector, 
RL142, SRL40, or SRL30. A simlar set of end points was derived from 
RL1 42, SRL30, and SRL40 by the addition of an Xho I linker (CCTCSAGG) 
to the filled*, unique Eoo Rl site, forming the plasmids XRL142, XSRL30, 
and XSRL40. Vectors that encode larger amino-tBrrriiratl segments of large 
T were made from the Bam H (2533)-Eco Rl fragments of the mutants 
T23-L7, L27-S26, and d10/L27-S26, each of which include a 10 bp Eco Rl 
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Small fragments of the large T gene encoding either amino acid residues 
119-135 or 127-147 with, kt each case, either Lys or Thr at position 12S 
were constructed from the plasmids L27-d10/L7 and d10/L27-L7 (Figure 

11 linker (OCAAGCTTGG) was added to all free ends, and, after digestion 
with Hind III and Eco Rl, the mixture of fragments was ligated to the large 
Hind lil-Eco Fd fragment of pBR328. The two required products of each 
fjgafjan were distinguished by hybridization at different temperatures to an 
wild-type sequence from 4441-1423 (corresponding to 
s127-132)andwe 
Eco Rl site. Purified Eco Rl (4441)-Hnd II (4376) fragments de 
two of these plasmids (L7RH and d10/L7RH) were ligated to the Bam HI 
(2533)-Eco Rl (51.49) fragment of SRL30 and the Hind III (5171)-Bam HI 
(2533) fragment of S1 1-S33 to torm chimeras in which the region around 
Lys-128 has been transposeo to the amino terminus of large T. Fse amino 
acid sequences of the chimeras is shown in Figure 3b. 
5) Construction of fl-Galactasidase Fusion Vectors 
The plasmids pSKS107 and pMC1871 (Casadaban st al., 1983) each 
include the majority of the E. coli lac Z gene encoding amino acid residue 
8 and onwards of (3-galactosidase. A Bam HI site is present just after the 
jj-galactosidase termination oodon in pMC1871 and the poMinker s& 
quences that precede the lac Z gene in pSKS1 07 are suitably disposed for 



a three-fragment ligation using the smaller Pst l-CIa I fragment from 
pSKS1 07, the smaHer Cta l-Bam HI fragment of pMC1871 , and the smaller 
Bgl II Pat I fragment of RL1 8:BGL2666. The latter plasmid was constructed 
by conversion of the Hpa I site (2666) of RL18 (Kalderon and Smith, 1984) 
to a Bgl II site using synthetic inkers. 

Suitable fragments, purified by agarose gel electrophoresis, were gen- 
erated from this vector by partial Bam H-Eoo Rl digestion for fusion witti 
Bam HVEco Rl fragments of vectors encoding the amiio-terminal regions 
of large T (section 3, Experimental Procedures) to lorm SRL30-BGAL, T23- 
BGAL, L27-BGAL, and d10/L27-BGAL (Figure 5). The Hind IB-Bam HI 
Iragment of BGAL-1 was used to construct the plasmids L7RH-BGAL and 
d10/L7RH-BGAL (Figure 5) by simultaneous ligation to the Bam HI (2533)- 
Eco Rl (51 49) fragment of SRL30 (section 3, Experimental Procedures) and 
the Eco Ri <4441}-Hind ill (4376) fragment from either L7RH or d10/L7RH 



6) Construction of Pyruvate Kinase Fusion Vectors 

The SV40 early region pdyadenylalbn signal, derived from the plasmid 
RL18;BGL 2666, was added to the 3' end of the cDMA of chicken mus- 
cle type Ml pyruvate kinase contained h plasmid pPK300 (constructed 
by N. Lonberg by joining plasmids pPK20T and pPK102 at their common 
BstE II site; Lonherg and Gilbert, 19B3). This was achieved by ligation of 
Bgl ll-Eco Rl fragments following the conversion of the unique Pvu II and 
Apa 1 sites of pPK3C0 to Ego Rl and Bgl il sites, respectively. Other vec- 
tors, RL18-PK10and RL1&FK12, were constructed in an analogous fashion 
but using Eco Rl linkers of 10 bp (CGGAATTCCG) and 12 bp 
(CCGGAATTCCGG), respectively, rather than 8 bp (GGAATTCC) to con- 
vert the Pvu II site of pPK300 to an Eco Rl site. An Eco FU-Bam HI fragment 
that contains codon 1 7 onwards of pyruvate kinase followed by a termina- 
tion oodon and a polyadenylation signal was purified from these plasmids 
and ligated to Bam Hi-Eco Rl fragments of the plasmids RL142, T23-L7, 



The small Hind IK-Eco Rl fragment of plasmid L27HR (section 4 of Experi- 
mental Procedures) that includes SV40 DNA sequences 445S-4413 was 
shortened by Bal 31 exonuctease digestion (1 U/4 fig DNA/20 iA for 10 
min at 30°C) from the Hind III site, followed by the addition of an Xho I 
linker (CCTCGAGG) to the digested ends and subsequent oircuiarkation. 
The positions of the Xho I linker were determined lor several products (XR4, 
9, 12, 15. 22, 24, 27, 30, 40> by sequencing the small Bam Hl-Eco Rl 
the Eco Rl site. Purified Xho t-Eco Rl 
■re incorporated into a suitable pyruvate 
ie Bam Hl-Xho I 

fragment of XRL142, XSRL30, or XSRL40 (section 3, 
dures) and the Bam Hl-Eco Rl fragment cl RL18PK8, RL18:PK10, or 
RL18:PK12 (section 6 of Experimental Procedures). A- simiar short se- 
quence derved from d10/L27HR was also incorporated into a pyruvate 
kinase vector In similar fashion after conversion of the Hind III site of d10/ 
L27HR to an Xho I site. 

The Xho l-Eco HI of XR30 was reduced further in size, keeping the Xho 
I site fixed In the following way. XR30 was digested with Mbo II and a 
portion was treated with Bat 31 exonuctease. In each case, the shortened 
450 bp fragment containing SV40 sequences adjacent to the Xho I site 
was purified by agarose gel electrophoresis after the addition of either 8 
bp, 10 bp, or 12 bp Eco Rl linkers. Subsequent to digestion with Ecp Rl 
and Xho I, the resultant mixture of fragments was used directly to clone the 
required shortened SV40 fragment into a pyruvate kinase fusion vector by 
three-fragment ligation, as described above. The DNA sequence from the 
Hind III (5171) site contributed by SV40 large T coding sequence up to and 
beyond the start of pyruvate kinase coding sequence (which includes all 
ligation junctions) was verified for all constructs shew in Figures 7 and 8. 
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previously (Kalderon et a!., 1984). The speciScilies of all antibodies used 
have been demonstrated previously PAb419 and PAb423 {Harlow et a!., 
1981); mouse anti-0-galactosidase antibodies (a kind gift of Dr. J. A. 
Partaledis, University of Massachusetts, Amherst); rabbit anti-chicken mus- 
cle type Ml pyruvate kinase (Lonberg and Gilbert, 1983); RITC-conjugated 
i'Jh m r ij (a kind gift of Dr. Roger Moms, Nalknial Institute tor 
Medical Research, London); FTTC-conjugated rabbit anti-hamster and goat 
ant-rabbit serum (supplied by Nordic Ltd.) and were verified in the experi- 
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HOS cells and as the chemistry of the interaction of simple 
methylating carcinogens such as MNNG is particularly well 
defined 4 *, a comparison of the met gene from MNNG-HOS 
cells with the parental HOS cell gene may provide a model 
system with which to study the mechanisms of transformation 
of human cells by this carcinogen. 

In at least one iV-nitroso-JV-methylurea (NMU) induced rat 
mammary tumour, activation of the ras" transforming gene 
involves a simple G ■* A point mutation 3 . Notably, this is pre- 
cisely the mutation that NMU would be predicted to induce if 
it reacted directly with the transforming gene. It will be interest- 
ing to se,e whether the precise method of activation of this 
transforming gene and of other transforming genes detected in 
chemically transformed cells also reflects the particular 



chemicals used to induce transformation. Further characteriz- 
ation of the met transforming gene and its comparison with the 
non-transforming allele may provide an interesting insight into 
one of the ways in which chemical carcinogens can activate 
proto-oncogenes in human cells. 
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Sequence requirements for nuclear location of 
simian virus 40 large-T antigen 
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A point mutation in the simian virus 40 large-T gene, which was generated by mixed oligonucleotide mutagenesis and 
resulted in the conversion of Lys 128 to Tkr, produced a large-T antigen that was detected in the cytoplasm but not the 
nucleus of cells. Deletions within the surrounding sequence Lys- ,zs Lys-Lys-Arg-Lys-Val~Glu also produce cytoplasmic 
large-T and define a region of the protein involved in nuclear location. 



The large-T antigen of simian virus 40 (SV40), molecular weight 
M r 94,000, is sufficient to transform established and non-estab- 
lished rodent cells in vitro and to induce tumours in newborn 
hamsters'. It is also required for the productive infection of 
simian cells by SV40 2 . 

Large-T is a predominantly nuclear protein 3 but is also found 
in a modified form in the plasma membrane 4 ' 5 . During a produc- 
tive infection the nuclear form of large-T regulates the level of 
viral gene expression and stimulates viral DNA replication 
through an interaction with the viral origin of DNA replica- 
tion . To investigate whether DNA binding by large-T is also 
involved in transformation we have introduced mutations 9 into 
a region coding for a putative DNA binding domain of the 



protein 6 ' 10 " 13 . Within this domain is a striking tract of five basic 
amino acids ( lz7 Lys-Lys-Lys-Arg-Lys l: ") just downstream from 
a cluster of serine and threonine phosphorylation sites 14 . As the 
sense of the codons for lysine is unaffected by the action of the 
commonly used mutagen, sodium bisulphite, to mutate this 
region we devised a modification of oligonucleotide-directed 
mutagenesis [S -' 6 that allows the introduction of several denned 
point mutations in a single experiment. 

The large-T protein encoded by one of the mutants generated 
in this manner contains a threonine residue in place of Lys 128. 
This mutant large-T, although able to transform Rat i cells with 
wild-type efficiency as measured by focus formation in high 
serum, accumulates in the cytoplasm rather than the nucleus of 



©1984 Nature Publishing Group 



NATURE VOL, 3U 




Fig. 1 Mixed oligonucleotide mutagenesis. Plasmid pPVU-0 (ref. 
23) contains the wild-type SV40 early region from BamHl (2,533) 
to Pvull (270) between the Bom HI (375) and Pvull (4,676) sites 
of pBR328 (ref. 37) but with a deletion of 3 base-pairs at the 
junction of Pvull sites, thus destroying the enzyme recognition 
site at this point. In the figure, plasmid DNA sequences are shaded 
whereas the SV40-derived sequences are not. SI 1-S33 (ref. 9) is a 
derivative of pPVU-0 that contains a 10 bp £coRI linker 
(CGGAATTCCG) spanning a deletion between SV40 nucleotides 
4441 and 4421. An Mspl site is created at the junction of linker 
and SV40 nucleotide sequences around position 4421. Afspl- 
BamHl and BamW-EcoKl restriction fragments from SII-S33 
were purified by agarose gel electrophoresis followed by phenol 
extraction* and were mixed in equimolar proportions with pPVU-0 
DNA linearized at its unique Pvull site. After denaturation with 
alkali and subsequent renaturation" about 10-20% of the products 
migrated as form II (open duplex circles) during agarose gel 
electrophoresis and were presumed to be circular heteroduplexes 
of the form shown above. The hybridization products (2.5 u,g DNA) 
were dissolved, after ethanol precipitation in 1 0 u,! of ligation buffer 
(50mM Tris-HCJ pH8.0, 7.5 mM MgCl 2 , 1 mM ATP, lOmM 
dithiothreitol (DTT) and 0.02% gelatin) and to this was added 
2.5 u,g of the oligonucleotide mixture (synthesized as in ref. 38) 
and phosphorylated by T4 polynucleotide kinase, also in 10 u,l of 
ligation buffer. The components were mixed initially at 60 °C, the 
temperature was then gradually reduced to 0 "C over a 6 h period, 
after which additional dithiothreitol (to 5 mM) and T4 DNA ligase 
(6 units) were added. After 1 h at 0 "C, ligation was continued for 
4h at !5°C. Some of the ligated material was then used directly 
for transfection. Alternatively, after phenol extraction and ethanol 
precipitation aliquots were treated with S, nuclease (50 units per 
10 uJ/0.4 u,g DNA for 20 min at 37 °C) "before neutralization and 
transfection. S[ nuclease treatment reduced the number of transfec- 
tants about 10-fold. Parallel experiments showed that significant 
cleavage of nicked circular plasmid DNA did not occur at the S, 




Fig. 2 Mutants obtained using mixed oligonucleotide 
mutagenesis. The composition of the synthetic oligonucleotide 
mixture is shown above together with the single-stranded region 
of the gapped heteroduplex, formed as in Fig. I, to which the 
oligonucleotides were hybridized. Note that after hybridization the 
synthetic oligonucleotides are exactly apposed at their 5* ends to 
the double-stranded region of the heteroduplex and can therefore 
be ligated directly into the heteroduplex. The nucleotide sequence 
alterations in mutant plamids recoverd from the procedure 
described in Fig. 1 are shown. The inferred alterations to the amino 
acid sequence of !arge-T are presented in Table t. The mutation 
found in plasmid d3 should not theoretically have been introduced 
by the oligonucleotide mixture used. It was not studied further 
because it does not alter the amino acid sequence of large-T. 

the transformed cells. The phenotypes of other mutants with 
lesions in the same area help to define a region around Lys 128 
which is required for the nuclear localization of large-T. 

Mixed oligonucleotide mutagenesis 

Oligonucleottde-directed mutagenesis is currently the method 
of choice for the introduction of single, precisely defined muta- 
tions. However, this method is not generally used for the creation 
of large families of point mutants because of the labour or 
expense involved in obtaining a large number of individual 
synthetic oligonucleotides. In the procedure described here a 
mixture of oligonucleotides synthesized on a single resin was 
used as the mutagen. 

The mixture of oligonucleotides, each of 20 residues, was 
synthesized by including an equimolar mixture of all four 
nucleotides at five selected positions along its length (Fig. 2). 
The degenerate positions correspond to the second nucleotides 
in the codons for the basic sequence i27 Lys-Lys-Lys-Arg-Lys m , 
thereby potentially allowing the conversion of each of these 
residues to any of three different amino acid residues. Thus the 
oligonucleotide mixture contained 4 s components, one of which 
was an exact copy of the wild-type SV40 large-T coding sequence 
in this region and 15 of which contained a single base change. 
The oligonucleotide mixture was incorporated into plasmid 
DNA bearing a cloned wild-type large-T gene by direct ligation 
into a gapped heteroduplex, formed between wild-type plasmid 
DNA and two restriction fragments of a linker insertion/deletion 
mutant derivative of the plasmid (Fig. 1 ). n 

After transfection, colony hybridization r using a 19-nucleo- 
tide long probe of sequence corresponding to the wild-type gene 
in this region (4,423—4,441 :ACCTTTCTCTTCTTTTTTG) was 
used to detect plasmids bearing mutations within this sequence. 
Washing filters at increasing temperatures 18 following hybridiz- 
ation at room temperature clearly distinguished between wild- 
type plasmids in which hybrids were stable up to about 55 °C 
(in 6XSSC) and others which showed no detectable hybridiz- 
ation beyond about 45 °C. Although this latter class could itself 
be resolved into two distinct categories because of differences 
in the stability of hybridization to the oligonucleotide probe, 
plasmid DNA was isolated from all poorly hybridizing colonies 
and examined by restriction enzyme mapping. All plasmids for 
which hybrids were not stable above ~37 °C, contained large 
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Table 1 Biological properties of mutant large-T proteins produced by mixed oligonucleotide mutagenesis 



Focus formation on Rat-1 cells Replication Location of 

large-T in 

10>gDNA 1 ugDNA Plasmid Virus transformed 



Cytoplasmic 
Cytoplasmic 
Mixed 
Mixed 



(2) 33 (A) 31 Dt Dt Mixed 
(1) 56 (A) 37 + + Nuclear 

(3) 24 (D) 74 + + Mixed 

(4) 40 (C) 103 



Transformation assays: Plasmid DNA <10u.g or I u.g+9 u« salmon sperm DNA) was iransfecKd into semi-confluent Rat-1 cells by the calcium phosphate method 
Cells were grown in Dulbecco's modified Eagle's medium supplemented with 10% fetal calf serum. Dense foci were visible after about 14 days post-transfection and 
were counted 3-9 days after their first apperance. The numbers above represent the mean of duplicate assays expressed as a percentage of the number of foci induced 
by the wild-type plasmid, pPVU-0, in the same experiment. The numbers of fod induced by pPVU-0 in these experiments were as follows: Experiment (1 ) 1,220; (2) 
1,150; (3) 446: (4) 500; (5) 464; (6) 694; (7) 634. (A) 170; (B) 94; (C) 50; (D) 76; (E) 62. The deletion mutants RL6 and S5-S32 (ref. 9) which encode a wild-type 
small-t but severely truncated large-T proteins of 175 and 142 amino acids, respectively, failed to induce any foci in these assays. 

Replication assays: Plasmid UNA and vira! replication assays were performed as described elsewhere*. (-) Indicates that no DNA was detectable in an experiment 
that would detect 0.1% of wild-type DNA accumulation or that no plaques were detectable within 4 weeks (wild-type plaques are normally visible by 10 days): ( +) 
indicates behaviour indistinguishable from wild-type and (D) (defective) behaviour intermediate between these extremes. 

Subcellular localization: Cells derived from transformed foci of Rat-1 cells were grown on glass coverslips, fixed in 4% formaldehyde in phosphate-buffered saline <20 
min at room temperature), permeabilized with 1% Nonidet P40 (5 min at room temperature) and incubated with hamster anti-tumour serum followed by fluorescein 
isothiocyanate-labelled rabbit anti-hamster antibody (Nordic). dlOand XI2 transformed cells were also examined using monoclonal antibody PAb423 (ref. 21), followed 
by rhodamine isothiocyanate-conjugaled rabbit anti-mouse antibody. Controls using normal Rat 1 cells and normal hamster serum were included. Normal Rat-t cells 
were also generally still present in the population of cells derived from dense foci and thus acted as internal controls for non-specific staining, immunofluorescence was 
scored as nuclear, cytoplasmic or mixed. Mixed indicates that three different patterns of staining could be seen in individual cells within a population of transformed 
cells. Some cells were stained only in the nucleus, others only in the cytoplasm and the rest in both compartments. The most extreme example of this phenotype is dl. 

n by staining with the large-T specific monoclonal antibody PAb423 following manual microinjection of each of the mutant plasmids 

wild-type, t Less than 10% of wild-type DNA accumulation, t Plaques visible 6 days later than wild-lxpe. 



deletions (about 2 kb) that included the target for mutagenesis. 
In contrast, all plasmids for which hybrids were stable to 
between 40 °C and 45 °C were of wild-type size, indicating the 
presence of point mutations within the target area. This was 
confirmed by determining the DNA sequence in this region of 
ail plasmids which formed hybrids of intermediate stability, 
using the oligonucleotide primed dideoxy procedure applied to 
double-stranded DNA 19 . 

Nine of the eleven plasmids sequenced in this way were found 
to have single point mutations of the wild-type sequence at 
positions corresponding to the degeneracies in the oligonucleo- 
tide mixture (Fig. 2). One mutant plasmid (dl) contained two 
such alterations and another contained an A to C transversion 
at a position that should have been invariant (and wild-type) 
in the oligonucleotide mixture. Two independent isolates of one 
mutant plasmid (XI2; dlO) were recovered. In a subsequent 
experiment using the same oligonucleotide mixture, three further 
single point mutants were isolated (W. H. Colledge, personal 
communication). Thus, 12 of the 15 possible point mutants have 
been obtained. Mutants were recovered corresponding to each 
of the degenerate positions in the oligonucleotide mixture and 
examples of each kind of mutagenic change ( A-+ C, G, T) were 
found. The overall efficiency of production of point mutants 
was enhanced from 0.7% to 3% by the inclusion of an S, 
nuclease digestion step immediately before transfection, de- 
signed to linearize those gapped heteroduplexes to which no 
oligonucleotide had hybridized. The ease with which large num- 
bers of plasmids can be reliably screened for the presence of 
point mutations by hybridization to an oligonucleotide of wild- 
type sequence compensates for the relatively low rate at which 
mutant plasmids are produced. The genotype of the mutants 
recovered demonstrates that the mutagenic procedure selects 
for the incorporation of synthetic oligonucleotides containing 
single deviations from the wild-type sequence, from a large 



excess of oligonucleotides with multiple alterations. The prin- 
ciple of mixed oligonucleotide mutagenesis should be applicable 
to all methods of conventional oligqnucleotide-directed 
mutagenesis currently in use 16 . 

Transforming activity 

The transforming potential of mutant plasmids was assayed by 
measuring the induction of dense foci on a monolayer of Rat- 1 
cells following calcium phosphate transfection 20 . The formation 
of dense foci was large-T dependent, as shown by the failure 
of deletion mutants that encode a wild-type smatl-t but severely 
truncated large-T proteins, to produce dense foci (Table 1). 
However, all point mutants examined showed a transforming 
activity similar to wild-type. Although slight differences in 
specific transforming activity were observed, the latent period 
before foci became visible was never significantly different from 
wild-lype (13-14 days). 

Subcellular localization 

Individual foci of transformed Rat-1 cells were picked, the cell 
population expanded and the subcellular location of large-T 
assessed by indirect immunofluorescence of permeabilized fixed 
cells using serum from hamsters bearing SV40-induced lumours 
as the first antibody. 

Rat-1 cells transformed by wild-type large-T, encoded by the 
plasmid pPVU-0, exhibited a typical nuclear staining with no 
detectable nucleolar or cytoplasmic staining (Fig. 3a). A strik- 
ingly different staining pattern was seen for cells transformed 
by the point mutants, dtO and XI 2, both of which encode a 
threonine residue in place of Lys 128 (Fig. 3c). The nuclei appear 
dark against a bright cytoplasmic background. An identical 
pattern was seen using a monoclonal antibody (PAb423: ref. 
21) directed to the carboxy-terminal region of large-T as first 
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Fig. 3 Nuclear accumulation of large-Tin transformed RaM cells and in Vero cells is abolished by alteration of Lys 128 to Thr and impaired 
by mutation of the surrounding basic amino acid residues. Individual dense foci of RaM cells transformed by wild-type pPVU-0 (panel a), 
the double point mutant dl (Lys 1 29-* Met, Lys 131 -*Thr) (panel b) and the point mutant dlO (Lys 128 Thr) (panel c) were picked and 
expanded. Cells transformed by dl were then cloned successively by passage through soft agar and by dilution. Cell lines were fixed and 
stained by indirect immunofluorescence as described in the legend for Table 1, using hamster anti-tumour serum as the first antibody. The 
location of farge-T in Vero cells was examined by the same indirect immunofluorescence procedure using the monoclonal PAb423 as first 
antibody, 18 hr after manual microinjection" of about 200 copies of plasmid DNA into the nucleus of confluent cells growing on glass 
coverslips. The plasmids injected were wild-type pPVU-0 (panel cf), dl (panel e) and dlO (panel /). The fields shown in d-f are or confluent 
monolayers and include some cells that have not been microinjected and show no immunofluorescent staining. 



antibody. This antibody, unlike anti-tumour serum, does not 
recognize SV40 smail-t antigen which should be expressed in 
wild-type form by all mutant plasmids used in this study. The 
same cytoplasmic staining pattern was seen in Rat-1 cells trans- 
formed by d 1 0 that were isolated as colonies growing in soft agar. 

The exclusion of large -T from the nuclei of transformed Rat-1 
cells consequent to mutation of Lys 128 is not observed for 
similar mutation of individual adjacent amino acid residues that 
form part of the five residue basic tract {' 27 Lys-Lys-Lys-Arg- 
Lys' 31 ). Cells transformed by each of these mutants exhibit 
predominantly nuclear staining. In some cases (d2, X8, X46), 
the subcellular distribution of large-T appears indistinguishable 
from that of the wild-type protein. In other cases (X7, X18, d!3, 
d27), a slight defect in nuclear accumulation of large-T is 
manifest by the presence of some cytoplasmic, in addition to 
nuclear, staining in the majority of cells derived from an 
individual focus. Furthermore, a minority of cells from the same 
focus show predominantly cytoplasmic staining. 

The heterogeneous staining pattern (Fig. 3£>) is even more 
obvious for the double mutant, dl (Lys 129^-Met, Lys 131 -» 
Thr). To date, the subcellular distribution of d 1 large-Thas been 
examined in 16 foci, derived from two independent transforma- 
tion assays. In only one of these was a homogeneous staining 
pattern observed, with the mutant large-T apparently confined 
to the cytoplasm of the transformed cells. In all other cases, 
nuclear staining exceeded cytoplasmic staining in the majority 
of cells (60-95% ) ; some cells clearly exhibited both nuclear and 
cytoplasmic staining, whereas in others large-T was apparently 
either wholly cytoplasmic or wholly nuclear. Cells derived from 
three of these foci were cloned further either by limiting dilution 
or by passage through soft agar. In each case, the heterogeneous 
staining pattern was preserved, indicating that clonally derived 
sibling cells express a large-T antigen that can, at any one time, 
be present predominantly in either cytoplasmic or nuclear com- 
partments. 

The variable distribution of the large T protein encoded by 
the mutants dl, d!3, d27, X7 and XI 8 shows that mutations 
close to Lys 128 can modulate but do not prevent the accumula- 



tion of large-T in the nucleus. Although other interpretations 
are possible, the heterogeneous localization of large-T in cells 
transformed by these mutants may reflect a deficiency in the 
rate of nuclear accumulation less extreme than that observed 
for d 10, which allows the visualization of intermediate stages 
in the nuclear accumulation of large-T, for example following 
the breakdown of the nuclear envelope at mitosis. 

Transient expression 

The wild-type plasmid, pPVU-0, and mutant derivatives were 
manually microinjected into the nucleus of semi-confluent Vero 
celts 22 and the location of the encoded large-T products assessed 
by indirect immunofluorescence 1 8 h post-injection. The staining 
patterns observed were virtually identical to those seen for 
transformed cells but were of much greater intensity, presumably 
due to the large number of gene copies available for transcrip- 
tion. Thus, large-T coded by wild-type pPVU-0 was exclusively 
nuclear (Fig. 3d) and that encoded by dlO was seen only in the 
cytoplasm (Fig. 3/). Both nuclear and cytoplasmic staining was 
observed for the mutants dl, dl3, d27, X7 and XI 8 (Fig. 3c) 
but the heterogeneity in this staining pattern between individual 
cells in a population was not as marked as for transformed cell 
tines. 

Replicative activity 

As all mutations were introduced into a plasmid, pPVU-0 (ref. 
23) that contains the viral origin of DNA replication as well as 
the entire large-T gene, the ability of the mutant large-T proteins 
to stimulate the replication of viral origin-bearing episomes in 
permissive cells could be tested directly by transfection of mutant 
plasmids into CV-1 cells and quantitation of intracellular plas- 
mid DNA 72 h later 24 . In addition, the mutant large-T coding 
sequences from these plasmids were reconstructed into viral 
DNA and the viability of mutant virus tested by plaque assay 25 . 
The two assays gave similar results (Table i), indicating that 
several of the mutants were unable to support viral DNA replica- 
tion. Many other mutants' with lesions either side of those 
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Fig. 4 Amino acid sequence of large-T proteins with deletions in 
the vicinity of Lys 128. Mutants were constructed from wild-type 
pPVU-0 containing one or two EcoRI linkers CGOAATTCCG at 
the site of a small deletion 9 . The consequent inferred alterations 
to the wild-type amino acid sequence are shown. Subcellular 
localization of mutant large-T antigens in transformed cells was 
determined as in Table 1 and Fig. 3. ( +) ( - ) indicate the presence 
or absence, respectively, of nuclear immunofluorescence. Some of 
the large-T coded by T23-L7 was also detected in the cytoplasm. 
The box above the amino acid sequence of large-T indicates those 
residues which cannot be deleted without preventing nuclear 
localization of the mutants described here. Abbreviations used are: 
A, alanine: D, aspartic acid: E, glutamic acid; F, phenylalanine; 
G, glycine; H, histidine; I, isoleucine; K, lysine; L, leucine; N, 
asparagine; P, proline; Q, glutamine; R, arginine; S, serine; V, 
valine. 

described here also appear to be unable to stimulate viral DNA 
replication, indicating that this is a common phenotype of 
mutants from this region. 

The viability of mutants altered at Lys 129 and Lys 13 t, 
respectively, implies that these (heterogeneously localized) 
mutant large-T antigens must be able to interact effectively with 
DNA sequences around the viral origin of replication. Direct 
measurement of DNA-binding activity in vitro by the McKay 
assay 26 suggests that the non-viable cytoplasmic (dlO) and par- 
tially cytoplasmic (dl) mutants can also recognize SV40 origin 
DNA <E. Paucha, D.K. and A.E.S., manuscript in preparation), 
suggesting that the aberrant localization of these proteins is not 
related to their ability to interact with SV40 DNA. 

Sequences around Lys 128 

The data above show that alteration of Lys 128 and, to a lesser 
extent, the surrounding basic amino acid residues either modu- 
lates or abolishes the nuclear accumulation of large-T. The 
properties of a large number of single, double and triple point 
mutants with lesions either side of this region between amino 
acids 106 and 158, by contrast, do not exhibit any impairment 
in the nuclear accumulation of large-T 9 . The importance of Lys 
128 to the localization of large-T is highlighted further by the 
recent observation of R. G. Lanford and J. S. Butel (personal 
communication) that the only lesion in the large-T coding 
sequence of the PARA-cT variant 27 , isolated several years ago 
on the basis of its cytoplasmic phenotype, is the conversion of 
the codon for Lys 128 to that for Asn. 

It is likely, therefore, that Lys 128 forms an essential part of 
some structural feature of the wild-type large-T protein that 
allows it either to be transported to or retained within the 
nucleus. The other possibility that the variant large-T proteins 
coded by dlO and PARA-cT have acquired an affinity for cyto- 
plasmic components seems improbable, in that a novel specific- 
ity of binding is unlikely to be endowed by two different amino 
acid changes. 

The context in which Lys 128 serves to localize large-T to the 
nucleus has been probed further by examining the properties 
of a number of insertion/ deletion mutants with lesions in the 
immediate vicinity of Lys 128 (Fig. 4). These were made by 
standard procedures as part of a more comprehensive mutagenic 
analysis of large-T 9 . As all of the mutants shown in Fig. 4 were 
able to transform Rat-1 cells (Table 2), the subcellular distribu- 
tion of the mutant large-T antigens could be examined in cells 



Table 2 Transforming activity and subcellular localization of large-T 
proteins with deletions in the vicinity of Lys 128 









Transformation of Rat-1 
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Net change 




No. of 


No. of 
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amino acid 


Nuclear 








residues 


arge- 


iO^gDNA 


1 tig DNA 








81 


44 


S1-S32 


0 




3, 20, 35* 


30 


S1-S28 


+1 




30 


40 


S17-S36 






66, 50 


36 


S17-S29 






38, 53 


17 


S14-S33 










S14-S38 


+2 








S11-S33 


-3 




72 


75 


S11-S38 


0 




110 


35 


T23-L7 


-6 
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The transforming activity and subcellular locations of the mutant 
large-T proteins in transformed cells is given. These parameters were 
assayed as described in Table I. 

* These foci appeared 7, 3 and 6 days later than wild -type, respectively. 



derived from transformed foci. In all cases where any part of 
the basic tract of amino acid residues (127-131) is interrupted, 
even if this does not include Lys 128, large-T is seen only in 
the cytoplasm of transformed cells (Fig. 4). However, the normal 
nuclear accumulation of large-T was not detectably impaired 
by the alteration of as many as four consecutive amino acid 
residues distal to Glu 133 and was only marginally affected by 
changing as many as ten residues proximal to Lys 127. Thus, if 
there exists a region of primary amino acid sequence around 
Lys 128 that directs large-T to the nucleus, it probably does not 
extend beyond Lys 127 or Glu 133. The tolerance of the nuclear 
large-T mutants L27-S26 and T23-L7 to variation in the length 
of polypeptide chain separating this sequence from the rest of 
the molecule perhaps also suggests that the function of Lys 128 
is not dependent on a precisely structured three-dimensional 
configuration within the protein. The cytoplasmic location of 
several different insertion/ deletion mutants supports the argu- 
ment presented earlier that this phenotype does not result from 
a novel retention of large-T in the cytoplasm but rather from a 
defect in transport to or retention within the nucleus. 

All of the cytoplasmic insertion/ deletion mutants, like dlO, 
transform Rat-1 cells according to dense focus formation in high 
serum with an efficiency and latent period similar to wild-type 
(Table 2). 

Transformation by non-nuclear T 

SV40 large-T is an unusual transforming protein in that it can 
transform primary cells as well as established cell lines 28 and 
gives rise to a gradation of transformed phenotypes 29 . In this 
study only the ability of large-T to transform an established cell 
line, Rat- 1, in high serum was measured by the criterion of 
dense focus formation. None of the cytoplasmic large-T deletion 
or point mutants showed a significant deficiency in transforming 
activity relative to wild-type in this assay. Preliminary experi- 
ments indicate that the point mutatit, dlO, also transforms Rat-1 
cells, by the criterion of colony formation in soft agar at high 
serum concentration with an efficiency similar to wild-type. 
However, any implication that large-T need not be nuclear to 
transform, must be tempered by two considerations. First, 
although large-T is seen by immunofluorescence only in the 
cytoplasm of cells transformed by dlO or by any of several 
deletion mutants, some large-T may nevertheless be present in 
the nucleus but remain undetected, either because of its very 
low abundance or because it is in a form that is not accessible 
to the antibodies used for its detection. Second, it is possible 
that differences in the transforming activity of nuclear and 
non-nuclear large-T species may be revealed by more stringent 
assays. By using such assays, R. G. Lanford and J. S. Butel 
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concluded that the cytoplasmic large-T encoded by the PARA-cT 
mutant induces a partially transformed phenotype in hamster 
cells, although it can induce colony formation in semi-solid 
medium and tumour formation in vivo* 0 . In view of the notion 
that the transformation of primary cells involves the combined 
action of a nucelar immortalizing gene product (such as polyoma 
virus large-T, adenovirus Ela protein, the myc gene product) 
and a membrane-associated protein 35 (such as the ras gene 
product, polyoma virus middle-T), it might be predicted that a 
non-nuclear large-T would be unable to immortalize or trans- 
form primary cells, whereas the plasma-membrane associated 
form of large-T 5 may suffice to transform established cell lines. 
These predictions are being tested using the cytoplasmic mutants 
described here. 

Mechanisms of nuclear localization 

The nuclear envelope differs from other cellular membranes in 
that it allows the free passage of virtually all proteins below a 
certain size (corresponding to approximately 18,000 M r ) in addi- 
tion to some larger proteins 32-34 . Thus, for any given protein it 
is not clear whether nuclear accumulation results from a specific 
mechanism that either facilitates or drives transport across the 
envelope or whether the protein is free to difiuse, for instance 
through nuclear pores and is then retained in the nucleus by 
virtue of an affinity for elements that are themselves confined 
to the nucleus. This distinction has not been made for SV40 
large-T, though the large size of the monomer may favour a 
specific transport mechanism. 
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The accumulation of nucleoplasms in the nucleus of Xenopus 
laevis oocytes 55 is the only case for which there is good evidence 
for the involvement of a nuclear transport system. Proteolytic 
digestion of the native pentameric protein has shown that a 
16,000 M r fragment derived from a single 33,000 M r monomelic 
unit is sufficient to allow the nuclear accumulation of partial 
proteolytic cleavage products as large as 101,000 M T , and is 
therefore presumed also to specify the nuclear location of the 
native protein. 

Whatever the mechanism of nuclear accumulation, the 
evidence presented here heralds the possibility that a very restric- 
ted region of the SV40 large-T amino acid sequence around Lys 
128 may be sufficient to ensure the nuclear location of this 
protein. Further experiments are in progress to determine more 
precisely to what extent this putative signal is independent of 
the rest of the large-T molecule and whether the signal can act 
in the context of other proteins that are normally cytoplasmic 
so as to direct them to the nucleus. The availability of the dlO 
mutant will be invaluable as a control for such studies. When 
this manuscript was in preparation, M. N. Hall et at. reported 
that restricted sequences derived from the yeast Mat a2 gene 
product can direct another protein to the nucleus in yeast 3 *. 
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The shape of the boundary of the Solar System, defined as the 
surface within which the gravitational attraction of the Sun, rather 
than that of the rest of the Galaxy, controls the orbital motion 
of bodies such as planets and comets, has been determined. Outside 
of this surface, the dominant factors are the radial tides due to 
the galactic centre and, especially, the vertical tides caused by the 
galactic disk. Orbits which are direct with respect to the galactic 
plane, have a boundary which differs from that for retrograde 
orbits, both being 10-20% oblate and both larger than the, usually 
assumed, present Oort cloud. The surface may have been the 
boundary of the early cloud of comets which was later reduced by 
the passages of stars and molecular clouds. 



Losses of bodies from the Solar System are caused by: 
sporadic perturbations by passing stars or giant molecular 
clouds, which occur at a rate of several per 10 6 or I0 9 yr respec- 
tively, perturbations by giant planets and the steady gravitational 
perturbation by the galaxy. The first two kinds of perturbations 
have been extensively studied 1 ^. We now describe the results 
of a three-dimensional stability study (inertial coordinates) of 
a system consisting of a comet and the Sun rotating 5 around 
the galactic centre [mass 1.3 x 10" M Q , distance 8.2 kpc (ref. 6)] 
and periodically traversing the nearly harmonic field of the 
galactic mid-plane (GMP) 7 . A two-dimensional model of a 
greatly simplified version of this problem has been studied 
previously 5 and a corresponding three-dimensional Hill surface 
is also known 8 . To get an indication of the extent and structure 
of the boundary of stability 9 paths of comets were integrated 
numerically for various initial parameters using Co well's method 
and fourth-order Runge-Kutta integration. Near the boundary 
of stability the orbit of a comet is non-keplerian, its semimajor 
axis a, average inclination i, eccentricity e and the direction of 
perihelion (5 are highly variable. Thus, for an initial i, the values 
of e and a were so chosen as to obtain an orbit which was 
bound for the age of the Solar System. The value of m had 
essentially no effect on the results. 
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